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"tinuing, (1.41) transforms into 

1-1 2 33 

-1 10 10 9 

001 11 
00-46 

t uuc otage we interchanged rows two and three. 
The equivalent system of equations is 
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(1-42) 









-4w - Cw 


- 








-?/ -f lOz - 
* - // -f- 2z 


Z + U + V 

f lOu -f 9v 
f 3?^ + 3t> 


= 

= 
-= 


(1 43) 


Letting v = 
solution of (1 


= 2 yields in 
.39) is x = X, y 


turn u = 3, 
' - 2X, 2 = 


z = -1, ?/ 
-X, ?z = 3X, 


= 2, i = 1 
, v = -2X, - 


The most general 
-oc < X < + oc. 



From the above consideration the reader can prove by mathematical 
iduction or otherwise that a system of n linear homogeneous equations 
i 771 unknowns always possesses a nontrivial solution if ra > n. 
\ We now consider the case in = n. The system 



a]x> = t, j 1, 2, 



(1.44) 



,the unique trivial solution or 1 = x 2 = x n = if |aj| 5^ from 
|,mple 1.10. The only possibility for the existence of a nontrivial 
[tion occurs for the case |aj| = 0. Triagonalizing the matrix o 
ds a new equivalent triangular matrix \\b]\\ such that |fy| = 0. 
[reader explain this. We thus obtain 



Let 



bl 




000 



- 



;h implies 6j = (no summation) for at least one value of i. If 
= 0, we have reduced our original system to one containing more 
liowns than equations, for which a nontrivial solution exists. If 
0, bZ~\ = 0, then x n = and again we have more unknowns than 
itions so that a nontrivial solution again exists. Continuing, we see 
the vanishing of at least one element along the main diagonal 
ies the existence of a nontrivial solution. 

ample 1.23. The determinant of the coefficient matrix of the system 

x+y+z+uQ 



5x - y -f 







(1.45) 
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vanishes. Triagonalizing the coefficient matrix yields 

1111 
-3 -1 -3 
0007 
000-3 



System (1.45) becomes ?>u 7u 
?>y z 0, so that z 3A, y 

On considering the system 



so that u 0, and we have x -\- y -\- z =0, 
A, z = 2\, - o < X < + oo. 



x - y = 

2x + y = 

x + y - 



(1.46) 



we see immediately that only the trivial solution x = y = exists. 

Generally speaking, a system of m linear homogeneous equations in 
n unknowns, m > n, does not possess a, nontnvial solution. The reader 
is referred to Ferrar's text on algebra for the complete discussion of this 
case. The rank of a matrix plays an important role in discussing solu- 
tions of linear equations. For a discussion of the rank of a matrix see 
the above-mentioned text. 



Example \ .24. The column matrix X = 



JC" 



will he called a vector The number 



x } is called thejth component of X. We call the number, n, the dimensionality of the 
space. The determinant of the matrix X r X is called the square of the magnitude of 
the vector X. If the x', ? = 1,2, . . . , r?, are complex numbers, we define ]X 7 X| as 
the square of the magnitude of X, where X 7 ' = Ha; 1 ;? 2 n ||, * = conjugate com- 
plex of a?. 



The system of vectors 



X r = 



r = 1,2, 



(1.47) 



is said to be linearly dependent if there exist scalars X 1 , X 2 , . . . , X w 
not all zero such that 

XX a = (1.48) 

An equivalent definition of linear dependence is the following: The 
set of vectors (1.47) is a linearly independent system if Eq. (1.48) implies 
\i = X 2 = = \ m = 0. 
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We now prove the following theorem: Any m vectors in an n-dimen- 
sional space are linearly dependent if m > n. The proof is as follows: 
The system of equations X a X a = is equivalent to the system of n 
linear homogeneous equations in the m unknowns X 1 , X 2 , . . . , X m 

x l a \ a = i = 1, 2, . . . , n] a = 1, 2, . . . , m 
Such a system always has a nontrivial solution for m > n. Q.E.D. 



1. Solve the system 

2. Solve the system 

3. Solve the system 



Problems 

x - 2y -f 3z - 4w = 

2x+y-z+u=Q 

&r - y + 2z - u = 

x + 2/y - z - 3w = 
2.r - ;(/ + 2 + 4w = 



x - 4// - 2s 
5x + ?/ - 82 



u = 
< u o 

2w = 
3w = 



4. Determine X so that the following system will have nontrivial solutions, and 
solve : 

\x = 4r + y 
\y = -2x + y 



1 
6. Show that the vectors Xi = 1 , X 2 

2 







1 
1 are linearly independ- 





erit. Given X = 1 , find soalars X], X-2, Xs such that X 
II * 

1.6. Quadratic Forms. The square of the distance between a point 
P(x, y, z) and the origin 0(0, 0, 0) in a Euclidean space is given by 

L 2 = x 2 + y 2 + z 2 (1.49) 

using a Euclidean coordinate system. The generalization of (1.49) to 
an n-dimensional space yields 



u- = 



(1.50) 



The linear transformation x 1 = a l a y a ,i,a = 1,2, . . . , n, \a] 9* 0, yields, 
from (1.50), 

n 
t = l 
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If we desire that the ?/'s he the components of a Euclidean coordinate 
system, we need 



Comparing (1.51), (1.52) yields 

= = ! Ij* 8 ^ (153) 

P if a ^ P v ' 



t= 1 



The system of Eqs. (1.53) in matrix form becomes 

A^A = AA r = E A = ||aj|| (1.54) 

Equation (1.54) implies in turn that A r = A" 1 . A matrix A satisfying 
(1.54) is called an orthogonal matrix. If AI, A 2 are any two different 
column vectors of A, then Af A 2 = 0. We say that the vectors are 
perpendicular. 

n 

For complex components (1.50) becomes L 2 = y x'x*, and for the 

t=i 

y's to be the components of an orthogonal coordinate system we find 
that the matrix A must satisfy A T A = E or A T = A~\ where A = ||a]||, 
a* the complex conjugate of a]. If A T = A" 1 , we say that A is a unitary 
matrix. 

We now consider the quadratic form 



Q = a a px a xP a aj9 real; a, = 1, 2, . . . , n (1.55) 

and ask whether it is possible to find an orthogonal transformation 
X = BY such that (1.55) reduces to the canonical form 



Q = X,(2/') 2 (1.56) 



Let the student note that (1.55) may be written in matrix form as 

Q = X T AX (1.57) 

noting that X r AX is a matrix of just one element and so is written as 
a scalar. Under the transformation X = BY, (1.57) becomes 

Q = (BY) r A(BY) 

= Y r (B r AB)Y (1.58) 
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\i 





X 2 





X 3 





000- 


\n 



(1.59) 



that is, B^AB must be a diagonal matrix. Our problem has been reduced 
to that of finding a matrix B satisfying (1.59), arid hence satisfying 



or 



B-'AB = HA.SJ 

AB = B||X,J| 



(1.60) 



since B r = B" 1 is required if X = BY is to be an orthogonal transforma- 
tion. We may consider A to be a symmetric matrix, A = A r , since 

Q s= X r [^-(A + A T )]X + X r [-|(A - A r )]X, and X T [|(A - A r )]X = (see 
Prob. 9, Sec. 1.1), while ^(A + A. T ) is a symmetric matrix. 

We now attempt to find the square matrix B satisfying (1.60). If 
B = l|6 w ||, Eq. (1.60) becomes 



aj = / 



i,j = 1, 2, . . . ,n (1.61) 



For j = 1 we have 



or 



n 

} 



= X, 



(1.62) 



If BI is the column matrix (vector), 



, (1.62) may be written 



or 



i = XBi X = 

(A - XE)Bi = 



(1.63) 
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Equation (1 .63) represents a system of n linear homogeneous equations 
in the n unknowns comprising the column matrix BI. From Sec. 1.4 a 
necessary and sufficient condition that nontrivial solutions exist is that 



or 



|A - XE| = 



a 22 X 



a 2n 



a n X 



= 



(1.64) 



We call (1 .64) the characteristic equation of the matrix A. It is a 
polynomial equation in X of degree n, so has n roots Xi, X 2 , . . . , X n , the 
X t real or complex. The roots X], X 2 , . . . , X n determine the column 
vectors BI, 62, . . . , B n , which in turn comprise the matrix B, that is, 



B = 



B 2 



BJ< 



a square matrix 



The solution BI of ABi = X]Bi is called an eigenf unction, eigenvector, 
or characteristic vector. Xi is called the eigenvalue, or latent root, or char- 
acteristic root corresponding to the eigenfunction Bj. If B! is a solution 
of (1 .63), so is Bi/length of BI, a vector of unit length. If the B r , i = 1 , 2, 
. . . , n, are unit vectors, it is easy to prove that the matrix B is an 
orthogonal matrix. Let 



AB 2 = X 2 B 2 



X! ^ X 2 



(1.65) 



Then (ABO* 1 s= BfA 71 = BfA = XJBf so that BfAB 2 = XjBfB 2 . More- 
over Bf AB 2 = X 2 BfB 2 , so that (Xi - X 2 )B[B 2 = and B[B 2 = 0. If the 
X t , i = 1 , 2, . . . , n, are all different, the matrix B is an orthogonal 
matrix. 

Example 1.25. We now firxd the linear orthogonal transformation which transforms 
the quadratic form Q 7x 2 + 7?/ 2 -\- 7z* -f 6x/y -f- 8?/z into canonical form. We have 



Q- 

The characteristic equation is 



7 3 0| 

374 
047 



7 - X 3 

3 7 - X 4 
4 7 - X 



which reduces to (7 - X)(X - 12) (X - 2) = so that Xi = 7, X 2 = 12, X 3 = 2. For 
\i 7 Eq. (1.63) becomes 36 2 = 0, 36i -f 46. = 0, 46 2 = 0. A unit vector BI whose 
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components b 1; b> 2 , b 3 satisfy these equations is Bi = 



4 
5 


_ 3 
5 



For X 2 - 12 Eq. (1.63) 



becomes -56i + 36 2 = 0, 36i - 56 2 -f 46 3 = 0, 46 2 - 56 3 = 0, so that B 2 



5 \/2 
_1 
V2 
4 



5 \/2 



For A 3 = 2 we obtain B 3 



5 \/2 



5 V2 



so that 



43 3 








43 3 




5 5 V2 5 V2 

vl vl 

34 4 


and 







5 5 \/2 5 V2 

o 4 4 

V2 \/2 
3 4 4 


V 


5 5 A/2 5 V2 


/A A /O ,., 


i 




5 5 \/2 5 V2 
),.} 




5 i 

1 


/2 (4 V2 " 






AW) 


a a 


?/ ~ 5 ^ 
Z ~ 5^N 


/2 

-- (-3 \/2t 
/2 


/ 4 


-4, 


- 4w) 


.0 



The quadratic form Q = 7x 2 + 7y z -}- 7z 2 + GJ-?/ + Syz be(;omes 

Q = 7it 2 + 12^ 2 + 2w 2 
under the linear orthogonal transformation (J.66). 

Each root X, i = 1, 2, . . . , n, of |A XE| = determined a column 
vector of the orthogonal matrix B. If multiple roots of |A XE| =0 
occur, it appears at first glance that we cannot complete the full matrix B. 
However, we can show that an orthogonal matrix F exists such that the 
quadratic form Q = X T AX, A = A r is canonical in form for the trans- 
formation X = FW. 

First let us note the following pertinent facts: If X = BY, Y = CZ are 
orthogonal transformations, then X = (BC)Z is an orthogonal trans- 
formation. We have (BC) T = C T B T = C^B- 1 = (BC)" 1 , so that BC 
is an orthogonal matrix. In other words, the matrix product of orthog- 
onal matrices is an orthogonal matrix. Next we note that, if B is an 
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orthogonal matrix in a /^-dimensional space, k < n, then 



C = 



1 
1 


o 














o . . . 


1 





o ... 









B 







o: 





is an orthogonal matrix for the n-dimensional space. The reader can 
easily verify that C has the necessary properties for an orthogonal matrix. 
Finally the roots of |A - XE| = are the roots of |B- 1 AB - XE| = 0, 
and conversely. From B~ 1 AB - XE = B-^A - XE)B we have 

|B- 1 AB - XE| = IB-KA - XE)B| 

= JB- 1 |A - XE| |B| 

= IB- 1 ) |B| |A - XE 

= |B-'B| |A - XE| 

= |A - XE| 
which proves our statement. 

Now let Xi be any root of |A XE| = 0. It is immaterial whether Xi 
is a multiple root. We solve (A XiE)Bi = for BI, BI a unit vector. 
We now obtain an orthogonal matrix B with BI as its first column. This 
can be done as follows (the method does not yield a unique answer) : Let 
62 = \\bt2\\ be the elements of the second column of B. In order that 62 

n 

be orthogonal toBi, we need Y 6A 2 = 0. This is a single homogeneous 

1=1 
equation in the unknowns 6 t 2, i = 1, 2, . . . , n. We know that we 

n 

can find a nontrivial solution which can be normalized so that } 6 t 2 2 = 1. 

1 = 1 

n n 

To obtain the third column, we need \ 6 t i6 t3 = 0, N 6 t2 fr t3 = 0. A 

*i t=i 

normalized nontrivial solution exists for n > 2. By continuing this 
process we construct an orthogonal matrix B. The final column of B 
involves a system of n 1 equations in n unknowns for which a nontrivial 
solution exists. From the construction of B it follows that B -1 AB has Xi 
as the element of the first row and first column and has zeros elsewhere 
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in the first column. From 

(B~ 1 AB) T = B r A T (B- 1 ) r = B-'ACB 7 )- 1 = B^ACB- 1 )- 1 = B^AB 

we note that B~ ! AB is a symmetric matrix. We have used the facts 
that A = A T , B T = B" 1 . Hence under the orthogonal transformation 
X = BY, Q becomes 



Q = X r AX - Y T (B 'ABjY 

Xi 





= \\y l // 2 //"I; ' 



y l 



The characteristic roots of the matrix \\c' a p\\ 

- 



\i - \ 

6*22 ~ X 

c 32 



X 



B~ ] AB satisfy 





= 



C n2 C nn X 

Hence the remaining roots of |A XE| =0 satisfy 
c 22 X r 2 a * * ' c 2n 



X 



- 



n 

By the same procedure we can reduce Y c a py"y ft to the form 



a,^ = 3 



so that Q becomes Q = \i(z 1 ) 2 + X 2 ^ 2 ) 2 + 



with Xi = X 2 if Xi 



28 



ELEMENTS OF PURE AND APPLIED MATHEMATICS 



is a repeated root. Continuing this process reduces Q to canonical form. 
Q.E.D. 
1.6.' Positive -definite Quadratic Forms. Inverse of a Matrix. Let us 

consider the quadratic form 

Q = X r (S T S)X (1.67) 

where S is a triangular matrix, S = ||sj||, s] = for i > j. From (1.67) 



we have Q = (X 7 'S r )(SX) = (SX) r (SX) = 



1=1 



Q > unless tf Q x a = 0, i = 1, 2, . . . , n. If |sj| 
this system of equations has only the trivial solution 

= x n = 



It is obvious that 
0, we know that 






Thus the special quadratic form (1 .67) for |S| 9* has the property Q > 
unless x 1 = x 2 = = x ri = 0. If Q = X r AX > unless X = 0, we 
say that Q is a positive-definite quadratic form. For such a form it can 
be shown that 



|a n | > 



an 012 



an ai2 

a-21 022 



> 



= |A| > (1.68) 



(see "Algebra," by W. L. Ferrar, Oxford University Press). Conversely, 
(1 .68) implies that Q is positive-definite. 

The above considerations suggest that, if Q is a positive-definite form, 
Q X T AX, A = A r , then there exists a triangular matrix S such that 
S r S = A. We now show that this is true. The equation S r S = A 
implies 



I 



ij = 1,2, . . . ,n 



(1.69) 



For i = j = 1 we have s^ = an so that Su = (an)*. For i = I, j > 1 
we obtain SnSij aij since 2 i = Ssi = * " = s n i = 0, so that 



= 2, 3, . . . , n. For t = jf = 2 we have (i 2 ) 2 + (22) 2 = a 22 so that 

/ a 2 y /I a n a i2 

$22 = (a 22 *J 2 )* = I 22 -- -I = I I a 2 i a 22 

~ 



V 
I 

/ 



Remember that o# = o,-. For i = 2, j > 2 we have s^Sij + S 22 s 2 y 



LINEAR EQUATIONS, DETERMINANTS, AND MATRICES 



29 



and s<2j = (a 2j Si2Sij)/S22. Continuing this process, one obtains the 
set of equations 

n = (flu)* 



$22 = 



0H 
Ski = - - 



(1-70) 



+ ?-,.*)]> 



Given the numbers a tj , the elements s tj can be calculated step by step 
from (1.70). Let the reader show that 



(1 1 1 (1 1 2 1 3 



011 012 
021 022 



(1-71) 



The above method for determining S can be used to find the inverse 
matrix of A provided X 7> AX is positive-definite Since A = S^S, we 
have A- 1 = S^S 7 )" 1 - S^S" 1 ) 71 - To find S- 1 from S, we proceed as 
follows: Let T = S" 1 , t i3 = 0, i > j. From TS = E we obtain 



7t 

^ = 5, i,./ = 1,2, . . . ,n 



(1.72) 



Equation (1.72) enables one to solve for the t tj . Thus t^Su = 1, or 
tn = l/'&'ii. For i = 1, j = 2 we have nSi 2 + ^i2<s 22 = 612 = so that 
t\i = tnSi2/s 2 2. Continuing, we can compute / 13 , . . . , ti n , fat, ^23, 



Example 1.20. We invert the matrix 



1 2 1 J 

253 
1 3 4 



From (1.70) s n = 1, s 12 = 2, 1S = 1, s 23 = 1, s 33 A/2. From (1.72) u - 1, 

*12 = -2, ^13 = -01lSi8 + ^12fi23)/33 = l/A/2, /22 = 1, <23 = - 1 / \/2, <33 = l/\/2. 

Thus 
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and 



A' 1 - S-'(S- ] ) r 





1 -2 1 

I/ i 




1 - 


o i ' 

Vs 


5 




4 
\/2 


I 


1 




1 





V2 








1 




O 1 





V2 








I 




1 1 


1 


V2 




V2 V2 \/2 


r another method. Let us fir 




1 k 




El2 = 


1 






1 





11 -5 



3 -1 



1 -1 



has the property that Ki2B is a new matrix which can be obtained by multiplying the 
second row of B by k and adding these elements to the corresponding elements of the 
first row of B. Thus 



1 k 

P l 
1 



b\\ 612 



Let the reader show that placing k in the rth row and sth column of the unit matrix E 
produces a matrix E r , such that E r ,B is a matrix identical to B except that the elements 
b rct) a = 1, 2, . . . , n, are replaced by b ra + kb fa , r 7* s. Notice that |E r ,| = 1 for 
r 7* s. Now let A be a square matrix such that A| 5^ 0. We consider the matrix 
C - ||A, B||. For the A in Example 1.26 



121100 
253010 
134001 



(1.73) 



We manipulate C by operations on the rows until the first three rows and columns of C 
become $. These operations are equivalent to multiplying C on the left by matrices 
of the type E M discussed above. Let B be the product of all the E ra . Then 

BC - HBA, BEII . HE, B|| 

Hence BA - E, and B = A" 1 . We obtain B from ||E, B||. For example, starting 
with the C of (1.73), we multiply the first row by 2 and add to the second row, and 
subtract the first row from the third row. This yields 



121 100 
011-210 
013-101 
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We can easily obtain zeros in the first and third row of the second column. We now 
have 

10-1 5-20 

01 1-2 10 

00 2 1-11 

Adding ^ the third row to the first row and subtracting -3- the third row from the second 
row yields 

5 1 

2 2 



1 



1 - 



1 - 1 

2 2 

002 1-1 1 



Factoring 2 from the third row, multiplication by 



100 
010 

oo L 



yields the inverse matrix 



11-5 II 

1 -1 1 I 

1.7. Differential Equations. We now consider the system of differ- 
ential equations 

-7-2- = a, l a x a i, a = 1, 2, . . . , n\ a] = a{ = constants 
which can be written in the matrix form 

^ = AX A = A^ (1.74) 

We look for a linear transformation which will simplify (1.74). Let 
X = BY, so that (1.74) becomes 



(B~ 1 AB)Y 



(1.75) 



From previous considerations we have shown that it is possible to find 
a matrix B such that B~ 1 AB is diagonal (nondiagonal terms are zero). 
For this matrix B we have 



no summation on i 



(1.76) 



since 



B-'AB = 



Xj 


o . . . 





X 2 











X 3 








o . . . 


X B 
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The solution of (1.76) is 

+ D* e -V^ i = 1, 2, . . . , n 



(1.77) 



From X = BY we can solve for x*(t), i = 1, 2, . . . , n. The y\ i = 1, 2, 
. . . , n, are called normal coordinates. 

Example 1.27. Let us consider two particles of masses m\ t ra 2 , respectively, moving 
in a one-dimensional continuum, coupled in such a way that equal and opposite forces 
proportional to their distance apart act on the particles. The differential equations 
of motion are 



d Xi , 

(-JP = a(xi - 

,-Jj? .afo ~a: 2 ) 



(1.78) 



For convenience, let y\ = \/m\ xi, 2/2 = \^m 2 **> ">\ - a /mi, 

a 

so that (1.78) becomes 



a/ra 2 , 



k, 



fl 2/i 2 i 

-jgT - -?yi H- 



d*yi _ , 
df 2 " ; 

r W ~ di* I 2/2 1| I 

The characteristic equation is 

-co? - X 



*\ k 



"2 | 



y\ 

I 2/2 







AY 



so that Xi - -| { (w? -f col) + [(i - w 2 ) 2 + 4A; 2 ]i | . Let the reader find yi(t), t/ 2 (0, and 
hence xi(t), x z (t). 

1.8. Subdivision of a Matrix. A matrix A may be subdivided into 
rectangular arrays, each array in turn being thought of as a matrix. For 
example, 







2 -"1 3 4 




2 -1 


3 4 








01-12 




1 - 


-1 2 






A = 


3215 


= 


3 2 


1 5 








-1 4 6-2 




-1 4 


6 -2 








0-211 




-2 


1 1 








A, AJ 














A, A 4 || 










where 














Ai = 


2 -1 


A,-| 3 1 




3 2 

= -1 4 


A 4 = 


1 5 
6 -2 




01 














J. 


1 1 




-2 




1 1 
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The linear system 

y i = a^ i, a = 1, 2, . . . , n 
may be written 

k n 

a x<x + a X " i = 1, 2, 3, . . . , fc, * + 1, . . . , n 



In matrix form 



y 1 



y k+l 



y 



a} a\ - ' - 

2 2 

a l a 2 



- al 



a 



or 



Hence 



Xl 

Y 
2 



A 2 X 2 



A 3 Xi 



If we are not interested in solving for X M , 
eliminate X 2 from (1.79). We obtain 



(1.79) 
, x n , we can 



so that 



X 2 = A^CY, - A 3 XO |A 4 | 
Xi = (A! - A 2 A r< 1 A 3 )- 1 (Y 1 - 



In a mesh circuit the y* are the impressed voltages, the a] are the imped- 
ances, and the x l are the currents. 

dx 
1.9. Conclusion. In the calculus the solution of -r = ax, a = con- 

/7Y 

stant, is x = xoc at . Can we generalize this for the system -^- = AX? 

We see immediately that one would be led to consider matrices of the 
form 6 Ae . How should we go about defining such a matrix? From the 



calculus e* = } % n /n\. This suggests that if B is a square matrix we 
define 



n-O 



34 ELEMENTS OF PURE AND APPLIED MATHEMATICS 



E 



+ B + B 2 + . - - + B* + (1.80) 



This poses a new problem. What do we mean by the sum of an infinite 
number of matrices? We define 

B r ^E + B+~B 2 + - + ~ B' 
as the rth partial sum of (1.80). If lim B r exists in the sense that each 

r * so 

element of B r converges, we define 

e* = lim B r 

r > 

If B is a square matrix of order n, whose terms 65 are uniformly bounded, 
that is, |6j < A = constant, for i, j 1, 2, . . . , n, then lim B r 

r oo 

exists. The proof is as follows: Consider the matrix B 2 . Its elements 
are of the form b l j)f. Thus each element of B 2 in absolute value is less 
than nA 2 . The elements of B 3 are of the form blfiffi which are bounded 
by nM 3 . The elements of E k are bounded by n k ~ 1 A k . Hence every 
element of B r E is bounded by the series 



V 

Li 



nk - lAh _ 
~ ~~ 



Thus each term of B r converges since each term of B r E is a series bounded 

r 

in absolute value term by term by the elements of the series 1/n Y (n A ) k /k ! 

L*l 
* = 1 

which converges to (\/ri)e nA as r > oo . Let the reader define cos B and 
sin B. Is sin 2 B + cos 2 B = E? Let the reader also show that 



at 
for a constant matrix B. 

Problems 

1. Show that the roots of (1.64) are real if a, is real and a, = a,.. Hint: Consider 
AX XX, assume X Xi + t\ 2 , X Xi + 1X2, and show that \2 = 0. 

2. Consider the quadratic form Q = a a & a x&, the s l , aj complex. We may write 
Q =* X r AX. If A =* A T , show that Q is real by showing that Q = Q. A matrix A 
such that A A r is called a Hermitian matrix. 

3. If A and B are Hermitian (see Prob. 2), show that AB -f BA and t(AB BA) are 
Hermitian. 

4. Show that the roots of (1.64) are real if A is Hermitian. 



dt 2 "*" m dt = 


n , *Ee 


X l A- l 

v\ l-io 


B = h^ 

II 
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5. Find an orthogonal transformation which reduces 

to canonical form. 

6. Show that the characteristic roots of the matrix A are the same as those of the 
matrix B^AB. 

7. Write the system 

w -T^r + He-J7 = Ee 



in the matrix form 



where X 



Let X - CY, |C| ^ 0, and find C so that the system becomes 

d 2 Y .Hell Oi v tl -i -111 

W + ^\0 ->| Y =2U t||j 

Integrate this system of equations. 

8. Solve the system 

xi -f x t + x* + *4 - 
o-i - J-2 + x, - jr 4 = 

for xi t Xz by first eliminating xi, 0-4. 

9. Consider the system of differential equations 

d 2 X rfX 

dt* ^ A dt + BA "" U 

where A and B are constant matrices. Let 

X ss e*C! C constant 

and show that a; satisfies \w z d* + ^a] -f 6)| = if C is not the zero vector. 

10. The characteristic equation of A is the determinant |A XE| 0. This is a 
polynomial in X, written 

See Dickson, "Modern Algebraic Theories," for a proof that A satisfies its character- 
istic equation, that is, 

A n -f fciA"- 1 + 6 2 A~ 2 + - + ZuE - 
This is the HamiUon-Cayley theorem. 
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CHAPTER 2 

VECTOR ANALYSIS 



2.1. Introduction. Elementary vector analysis is a study of directed 
line segments. The reader is well aware that displacements, velocities, 
accelerations, forces, etc., require for their description a direction as well 
as a magnitude. One cannot completely describe the motion of a par- 
ticle by simply stating that a 2-lb force acts upon it. The direction of 
the applied force must be stipulated with reference to a particular coordi- 
nate system. In much the same manner the knowledge that a particle 
has a speed of 3 fps relative to a given observer does not yield all the 
pertinent information as regards the motion of the particle with respect 
to the observer. One must know the direction of motion of the particle. 

A vector, by definition, is a directed line segment. Any physical 
quantity which can be represented by a vector will also be designated as 
a vector. The length of a vector when compared with a unit of length 
will be called the magnitude of the vector. The magnitude of a vector 
is thus a scalar. A scalar differs from a vector in that no direction is 
associated with a scalar. Speed, temperature, _ 

volume, etc., including elements of the real- L a a > ?> 
number system, are examples of scalars. 

Vectors will be represented by arrows (see Fig. 
2.1), and boldface type will be used to distin- """FIG 21 

guish a vector from a scalar. The student 
will have to adopt his own notation for describing a vector in writing. 

A vector of length 1 is called a unit vector. There are an infinity 
of unit vectors since the direction of a unit vector is arbitrary. If a 
represents the length of a vector, we shall write a = |a|. If |a| s= 0, we 
say that a is a zero vector, a = 0. 

2.2. Equality of Vectors. Two vectors will be denned to be equal if, 
and only if, they are parallel, have the same sense of direction, and are 
of equal magnitude. The starting points of the vectors are immaterial. 
This does not imply that two forces which are equal will produce the 
same physical result. Our definition of equality is purely a mathematical 
definition. We write a = b if the vectors are equal. Moreover, we 
imply further that if a = b we may replace a in any vector equation by b 
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FIG. 2.2 



and, conversely, we may replace b by a. Figure 2.2 shows two vectors 

a, b which are equal to each other. 

2.3. Multiplication of a Vector by a Scalar. 
If we multiply a vector a by a real number x, 
we define the product xa. to be a new vector 
parallel to a; the magnitude of xa. is \x\ times 
the magnitude of a. If x > 0, xa is parallel to 
and has the same direction as a. If x < 0, xa. 

is parallel to and has the reverse direction of a (see Fig. 2.3). 
We note that 

x(y&) = (xy)a. = xya 
Oa = 

It is immediately evident that two vectors are parallel if, and only if, 
one of them can be written as a scalar multiple of the other. 






(atb)-f-c 



FIG. 2.3 



FIG. 2.4 



2.4. Addition of Vectors. Let us suppose we have two vectors given, 
say a and b. The vector sum, written a + b, is defined as follows: A 
triangle is constructed with a and b forming two sides of the triangle. 
The vector drawn from the starting point of a to the arrow of b is defined 
as the vector sum a + b (see Fig. 2.4). 

From Euclidean geometry we note that 



a + b = b +a 

(a + b) + c = a + (b + c) 

3 (a + b) = xa + xb 



(2.1) 

(2.2) 
(2.3) 



Furthermore, a + = a, (x + y)a. = xa + ?/a, and if a = b, c = d, then 
a + c = b + d. The reader should give geometric proofs of the above 
statements. 

Subtraction of vectors can be reduced to addition by defining 



a - b 



(-b) 
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An equivalent definition of a b is the following: We look for the vector 
c such that c + b = a. c is defined as the vector a b (see Fig. 2.5). 
2.5. Applications to Geometry. Let a, b, c, be vectors with a common 
origin whose end points A, B, C, respectively, lie on a straight line. 

A 

A 

-b 






FIG. 2.5 

Let C divide AB in the ratio x:y, x + y = 1 (see Fig. 2.6). We propose 
to determine c as a linear combination of a and b. It is evident that 

c = a + AC. But AC = xAB = x(b - a). Hence 

c = (1 - z)a + xb = y* + xb (2.4) 

Conversely, let c = ya. + xb, x + y = 1. If a, b, c have a common 
origin 0, we show that their end points lie on a straight line. We write 
c = (1 z)a + xb so that c = a + x(b - a). Since x(b a) is a 

vector parallel to the vector b a = AB, we note from the definition 
of vector addition that the end point 
C lies on the line joining A to B. 
Q.E.D. 

Equation (2.4) is very useful in 
solving geometric problems involv- 
ing ratios of line segments. 

Example 2.1. The diagonals of a paral- A \ 
lelogram bisect each other. Let ABCD be 
any parallelogram, and let be any point 
in space (see Fig. 2.7). The statement 
c d = b a defines the parallelogram 
ABCD. Hence 

i/ _i_ o\ _ a + c - b + d - I/K _i_ A\ FIG. 2.7 




The vector (a + c)/2 with origin at has its end point on the line joining AC [see 
Eq. (2.4)]. The vector (b + d) /2 with origin at has its end point on the line joining 
BD. There is only one vector from whose end point lies on these two lines, namely, 
the vector p. Hence P bisects AC and BD. 
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Example 2.2. In Fig. 2.8, D divides CB in the ratio 3 : 1 ; E divides A B in the ratio 
3:2. How does P divide CE } AD? 

Imagine vectors a, b, c, d, e, p drawn from a fixed point to the points A, B, C, D, 

E, P, respectively. From Eq. (2.4) we 
have 

c +3b _ 2a + 3b 

d __ e - 5 

Since p depends linearly on a and d, and 
also on c and e, we eliminate the vector 
b from the above two equations. This 
yields 

4d -f 2a = 5e 4- c 
or 
[ 2.8 4 d _L 2 a = 5 e _i_ l c 

The vector -d + fa with origin at must have its end point lying on the line AD. 
Similarly -|e + ^c has its end point lying on the line EC. These two vectors have 
been shown to be equal. Then 

p - |d + fa - |e + -Jc Why? 

Thus P divides A D in the ratio 2 : 1 and divides CE in the ratio 5:1. 

Problems 

1. Interpret -, 

2. a, b, c are consecutive vectors forming a triangle. What is their vector sum? 
Generalize this result. 

3. a and b are consecutive vectors of a parallelogram. Express the diagonal vectors 
in terms of a and b. 

4. a and b are not parallel. If xa. -f 2/b = la + wb, show that x I, y m. 

5. Show graphically that |a| + |b| ^ |a + b|, |a - b| ^ | |a| - |b| |. 

6. Show that the midpoints of the lines which join the midpoints of the opposite 
sides of a quadrilateral coincide. The four sides of the quadrilateral are not neces- 
sarily coplanar. 

7. Show that the medians of a triangle meet at a point P which divides each median 
in the ratio 1:2. 

8. Vectors are drawn from the center of a regular polygon to its vertices. Show 
that the vector sum is zero. "* 

9. a, b, c, d are vectors with a common origin. Find a necessary and sufficient 
condition that their end points lie in a plane. 

10. Show that, if two triangles in space are so situated that the three points of 
intersection of corresponding sides lie on a line, then the lines joining the corresponding 
vertices pass through a common point, and conversely. This is Desargues' theorem. 

2.6. Coordinate Systems. The reader is already familiar with the 
Euclidean space of three dimensions encountered in the analytic geometry 
and the calculus. The cartesian coordinate system is frequently used 
for describing the position of a point in this space. The reader, no 
doubt, also is acquainted with other coordinate systems, e.g., cylindrical 
coordinates, spherical coordinates. 
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We let i, j, k be the three unit vectors along the positive #, y, and z axes 
respectively. If r is the vector from the origin to a point P(x y y, z), then 
(see Fig. 2.9) 

r = xi + yj + zk (2.5) 

The numbers x, y, z are called the components of the vector r. Note 
that they represent the projections of the vector r on the x, y, and z axes, 
r is called the position vector of the point P. 




FIG. 2.9 

By translating the origin of any vector A to the origin of our cartesian 
coordinate system it can be easily seen that 

A = AJ + A 2 j + Ask (2.6) 

AI, A 2 , As are the projections of A on the x, y, and z axes, respectively. 
They are called the components of A. 

Let us now consider the motion of a fluid covering all of space. At 
any point P(x, y, z) the fluid will have a velocity V with components 
u, v, w. Thus 

V = u(x, y, z, t)i + v(x, y, z, t)j + w(x, y, z, t)k (2.7) 

The velocity components u, v, w will, in general, depend on the point 
P(x, y, z) and the time t. The most general vector encountered will be 
of the form given by (2.7). Equation (2.7) describes a vector field. 
If we fix t, we obtain an instantaneous view of the vector field. Every 
point in space has a vector associated with it. As time goes on, the 
vector field changes. A special case of (2.7) is the vector field 

V = u(x, y, z)i + v(x, y, z)j + w(x, y, z)k (2.8) 

Equation (2.8) describes a steady-state vector field. The vector field 
is independent of the time, but the components depend on the coordinates 
of the point P(x, y, z). 
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The simplest vector field occurs when the components of V are constant 
throughout all of space. A vector field of this type is said to be uniform. 

Example 2.3. A particle of mass ra is placed at the origin. The force of attraction 
which the mass would exert on a unit mass placed at the point P(x, y, z) is 



F . _ 



Gmr 



si 



_ _____ 

This is Newton's law of attraction. Note that F is a steady-state vector field. 
It is easily verified that if 



f , 
tnen 



A + B = (A l + 
A + yE = (xA l 



= BJ + B 2 j 

i)i + (A, + 
yBji + (xA 2 



)j + (A* + 3 )k 
yB 2 )j + (xA, + 



2.7. Scalar, or Dot, Product. We define the scalar, or dot, product of 
two vectors by the identity 



a b ss a| |b| cos 6 



(2.9) 



where 6 is the angle between the two vectors when they are drawn from 
a common origin. Since cos 6 = cos ( 0), there is no ambiguity as to 
how is chosen. 
From (2.9) it follows that 

a-b = b-a 
za-t/b = xy&'b (2.10) 

a a = |a| 2 = a 2 
a = b, c == d implies a c = b d 

If a is perpendicular to b, then a b = 0. Conversely, if a b = 0, 
|a| 7* 0, |b| p* 0, then a is perpendicular to b. 





cos0-J 
FIG. 2.10 

Now a b is equal to the projection of a onto b multiplied by the length 
of b (see Fig. 2.10). Thus 

a-b = (proj b a)|b| = (proj a b)|a| 
With this in mind we proceed to prove the distributive law, 

a- (b + c) =a-b + a-c (2.11) 
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From Fig. 2.11 it is apparent that 

a (b + c) = [proj (b + c)]|a| 

= (proja b + proj c)|a| 
= (proja b)|a| + (proja c)|a| 
= b a + c -a 
= a-b + a-c 

A repeated application of (2.11) yields 

(a + b) (c + d) = a (c + d) + b (c + d) 
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c=b-a 





a 

FIG. 2.11 FIG. 2.12 

Example 2.4. Cosine Law of Trigonometry (Fip 2.12) 

c = b - a 

c * c = (b - a) (b - a) 

= b-b + a-a-2a'b 

r 2 = ^2 _|_ 2 _ 2nb cos 6 

Example 2.5. i i = j j = k k == 1 , i j = j k = k i =0. Hence if 

A - Aii + A 2 j -f A 3 k 
B = #ii -I- B 2 j -h ^sk, then 



A B = 



+ A 2 B 2 + 



(2.12) 



Formula (2.12) is very useful. It should be memorized. 

Example 2.6. Let Ox l x*x 3 and Ox l x 2 x 3 be orthogonal rectangular cartesian coordi- 
nate systems with common origin 0. We now use the superscript convention of 
Chap. 1. The unit vectors along the x j axes, j = 1, 2, 3, are designated by i,. Simi- 
larly Iy, j = 1, 2, 3, is the set of unit vectors along the x j axes. Let a be the cosine of the 
angle between the vectors i, 10, a, = 1, 2, 3. The projections of ii on the $*, 2 , x* 
axes are aj, a*, a?, respectively. Hence ii = a}ii -f o?i 2 + ajig afl a . Let the 
reader show that 

(2.13) 

x^lft so that 
(2.14) 



We have (Fig. 2.13) r r, so that x a i - ^a. From (2.13) 

ft - 1, 2, 3 
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Equation (2.14) represents the coordinate transformation (linear) between our two 

3 3 

coordinate systems. In matrix form, X = AX. From } x l x l = ^ &x l it follows 

that X^X - X 7 'X, which in turn implies A 77 A = E or A 7 " = A" 1 . 

If U 1 , U 2 , U* are the components of a 
x 2 3 vector when referred to the x coordinate 
-i X -2 ^3 system, and if t/ 1 , U 2 , U* are the components 
the same vector when referred to the x 



i -2 -3 
(x , x , x ) 




coordinate system, one obtains 



(2.15) 



This result is obtained in exactly the same 
manner in which Eq. (2.14) was derived. 

From (2.14), ^ = a so that (2.15) maybe 



written 



(2.16) 



Equation (2.16) will be the starting point for the definition of a contravariant vector 
field (see Chap. 3). 



Problems 



-2i + 2j + 2k. Show 



1. Add and subtract the vectors a = 2i 3j + 5k, b 
that the vectors are perpendicular. 

2. Find the angle between the vectors a = 2i - 3j -f k, b = 3i - j - 2k. 

3. Let a and b be unit vectors in the xy plane making angles a and with the x axis. 
Show that a = cos a i -f- sin a j, b = cos ft i + sin j, and prove that 

cos (a j8) = cos a cos /3 -f sin a sin ft 

4. Show that the equation of a sphere with center at PQ(XQ, r/o, zo) and radius a is 
(x - x<>) 2 + (y - y ) 2 + (z- *o) 2 = a 2 . 

5. Show that the equation of the plane passing through the point PO(XO, 2/0, 20) nor- 
mal to the vector Ai -f Bj + Ck is 



A (x - 



B(y - 



- 2 ) - 



6. Show that the equation of a straight line through the point PO(XQ, 
to the vector li -f raj -f nk is 



parallel 



1. Prove that the sum of the squares of the diagonals of a parallelogram is equal to 
the sum of the squares of its sides. 

8. Show that the shortest distance from the point PQ(XQ, yo, ZQ) to the plane 

Ax + By + Cz + D - 

Axo + Byo + Czp -f- D 

(A 2 + B* + C*)i 
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9. For Eq. (2.14), Y 3*3? = Y &&. Show that this implies 

0=1 0=1 

Y aX = ? [ l a * J 

L< T = 1 if a -* 7 

10. For the a of Prob. 9 show that if U a = J^, F a = agF^, then 

3 3 

V V 

Z ^"^ 2, l aya 

a 1 a = 1 

2.8. Vector, or Cross, Product. One can construct a vector c from two 
given vectors a, b as follows : Let a and b be translated so that they have 
a common origin, and let them form 
the sides of a parallelogram of area 
A = |a| |b| sin (see Fig. 2.14). We 
define c to be perpendicular to the 
plane of this parallelogram with 
magnitude equal to the area of the 
parallelogram. The direction of c 
is obtained by rotating a into b 
(angle of rotation less than 180) and 
considering the motion of a right- 
hand screw. The vector c thus obtained is defined as the vector, or 
cross, product of a and b, written 




FIG. 2.14 



c = a X b = |a[ |b| sin 6 E 



(2.17) 



with |E 



= 1, E-a = E-b = 0. 
The vector product occurs frequently in mechanics and electricity, but 
for the present we discuss its algebraic behavior. It follows that 

a X b = -b X a 

so that the vector product is not commutative. If a and b are parallel, 
a X b = 0. Conversely, if a X b = 0, then a and b are parallel pro- 
vided |a| 9* 0, |b| 5* 0. In particular a X a = 0. 

The distributive law, aX(b + c)=aXb + aXc, can be shown to 
hold as follows: Let 



We attempt to show that u = 0. Since the distributive law for scalar 
multiplication holds, we have 



v u = v a X (b + c) - v (a X b) - v (a X c) 



(2.18) 
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for arbitrary v. In the next paragraph it will be shown that 

a (b X c) = (a x b) c 
Thus (2.18) may be written 

v u = (v X a) (b + c) - (v X a) b - (v X a) c 

= (v X a) b + (v x a) c - (v X a) b - (v X a) c 
= 

This implies that u = or v J_ u. Since v can be chosen arbitrarily, 
arid hence picked not perpendicular to u, it follows that u = so that 



c)=axb 



(2.19) 



Example2.7. Oneseesthati X i = 0, j X j 0,k X k = 0,i X j - k, j X k i, 

k X i * j- For the vectors a = aii + a^j -h #&k, b = 6ii + bzj -\- &ak we obtain 
a X b = (a 2 ba a 3 ?>2)i -f- (a^bi aib 3 )j -f- (ai6 2 a2^i)k from the distributive law. 
Symbolically 

i j k 

a X b - cii a 2 a s (2.20) 

b\ bz bz 

Equation (2.20) is to be expanded by the ordinary method of determinants. 
Example 2.8 

a i - 3j + 2k b = 4i + j - 3k 
i j k 

7i + llj + 13k 



a X b - 1 -3 2 
4 1 -3 

(a X b) a - 7 - 33 + 26 







(a X b) -b - 28 + 11 - 39 - 



Example 2.9. Rotation of a Particle. Assume that a particle is rotating about a 

fixed line L with angular speed w. We assume 
that the shortest distance of the particle from 
L remains constant. Let us define the angu- 
lar velocity of the particle as the vector, <*>, 
whose direction is along L and whose length 
is w. We choose the direction of c* in the 
usual sense of a right-hand-screw advance 
(see Fig. 2.15). Let r be the position vector 
of P with origin on the line L. It is a sim- 
ple matter for the reader to show that the 
velocity of P, say V, is parallel to, and has the 
same magnitude as, <> X r. Thus V = <*> X r. 
Example 2. 10. Motion of a Rigid Body with 
One Fixed Point. Let Oxyz be a fixed coordi- 
nate system, and OxyZ a coordinate system 
attached to the rigid body whose fixed point 
is the origin 0. Let P be a point of the rigid 
body. As time progresses, the coordinates 

x , #, 2 remain constant since the Oxijz coordinate system is rigidly attached to the 

moving frame. From (2.14) we have & ajz'. Hence 




FIG. 2.15 
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d& da! . dx' 

-*---** +a *-S 

so that^~ - -At~jx>, AX - ,*, or \\A}\\ - Ha;!!" 1 . If we define * * --Aj^ 
we have 

- -?" (2 - 21) 

3 8 

v~^ ^TA dx^ 

However, y x k x k represents the invariant distance from to P so that y x* -TT- = 

3 

y to*J 2 T* = 0. From Example 1.2 it follows that w* = wj. Equation (2.21) 



3 
and 



can now be written as 

dr 1 



^j.1 ^j2 ^3 

so that v = rr i + -TT- j + ~ir k = <*> X r, o> = w$i -h wjj -f a>jk. It follows from 

the result of Example 2.9 that the motion of a rigid body with one point fixed can be 
characterized as follows : There exists an angular velocity vector co whose components, 
in general, change with time, such that at any instant the motion of the rigid body is 
one of pure rotation with angular velocity o>. This property is very important in the 
study of the motion of a gyroscope. It can easily be shown that the most general 
rigid-body motion consists of a translation plus a pure rotation. 

2.9. Multiple Scalar and Vector Products. The triple scalar product 
a (b X c) has a simple geometric interpretation. This scalar represents 
the volume of the parallelepiped formed by the coterminous sides a, b, c, 
since 

a (b X c) = |a| |b| |c| sin 6 cos a 
= hA = volume 

where A is the area of the parallelogram with sides b and c and h is the 
altitude of the parallelepiped (see Fig. 2.16). It is easy to see that 
(a X b) c represents the same volume. Hence it is permissible to inter- 
change the dot and cross in the triple scalar product. Since there can 
be no confusion as to the meaning of a (b X c), it is usually written as 
(abc). The expression a X (b c) is meaningless. Why? We let the 
reader prove that 



a b X c = (abc) = 



(2.22) 
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From determinant theory, or otherwise, it follows that 

(abc) = (cab) = (bca) = -(bac) = -(cba) = -(acb) 

It is easy to show that a necessary and sufficient condition that a, b, c 
lie in the same plane when they have a common origin is that (abc) = 0. 
In particular, (aac) = 0. 




bxc 



FIG. 2.16 

The triple vector product a X (b X c) is an important vector. It is 
certainly a vector, call it V, since it is the vector product of a and b X c. 
We know that V is perpendicular to the vector b X c. However, b X c 
is perpendicular to the plane of b and c so that V lies in the plane of b and 
c. If b and c are not parallel, then V = Xb + juc. If b and c are parallel, 
V = 0. Since V a = 0, we have X(b a) + M(C a) = so that 

V = Xj[(c a)b - (b a)c] 



It can be shown that Xi s 1 so that 

a x (b x c) = (a c)b - (a b)c 



(2.23) 



The expansion (2.23) of a X (b X c) is often referred to as the rule of the 
middle factor. Similarly 



(a X b) x c = (a c)b - (b c)a 



(2.24) 



More complicated products can be simplified by use of the triple 
products. For example, we can expand (a X b) X (c X d) by consider- 
ing a X b as a single vector and applying (2.23). 

(a X b) X (c X d) = (a X b d)c - (a X b c)d 
= (abd)c - (abc)d 
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Also (a x b) (c X d) = [(a x b) x c] d 

= [(a c)b - (b c)a] d 
= (a-c)(b-d) - (b.c)(a 
a c a d 
b-c b-d 



d) 



Example 2. 11. 
(see Fig. 2.17). 
dius 1. Now 



Consider the spherical triangle ABC 
Let the sphere be of ra- 



B 



(a X b) (a X c) = b c - (a b)(a c) 

since a a = 1. The angle between a X b 
and a X C is the same as the dihedral angle 
A between the planes OAC and OAB, since 
a X b is perpendicular to tjie plane of OAB 
and since a X c is perpendicular to the 
plane of OAC. Hence 

sin 7 sin cos A = cos a cos 7 cos /3 
Problems 

1. Show by two methods that the vectors 
a = 3i - j + 2k, b = -12i + 4j - 8k are 
parallel. 

2. Find a unit vector perpendicular to the 
vectors a = i - j -f 2k, b = 3i + j - k. 

3. If a, b, c, d have a common origin, interpret the equation (a X b) (c X d) * 0. 

4. Write a vector equation which specifies that the plane through a and b is parallel 
to the plane through c and d. 

6. Show that d X (a X b) (a X c) = (abc)(a d). 

6. Show that (a X b) (b X c) X (c X a) = (abc) 2 . 

7. Show that a X (b X c) + b X (c X a) + c X (a X b) - 0. 

8. Find an expression for the shortest distance from the end point of the vector ri, 
to the plane passing through the end points of the vectors r 2 , r,j, 14. All four vectors 
have a common origin 0. 

9. Assume (abc) -^ 0. Let d = xa + j/b -f zc. Show that 




Fio. 2.17 



* (abc) 

10. If (abc) ^ 0, show that 
c-d 



i(adc) 
(abc) 



a X b 






" (abc) ~ " -* (abc) 
11. Consider the system of equations 



b X C 



(abd) 

(abc) 






(abc) 



-f Ciz = 



(2.25) 
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Let a oil + azj -f ask, etc., and show that (2.25) can be written as 

az + by -f cz d 
Show that - ^ etc., (abc) ^ 0. 

2.10. Differentiation of Vectors. Let us consider the vector field 
u = Ai(x, y, 2, Oi + ^ 2(3, ?/, 2, t)j + A*(x, y, z, t)k (2.26) 

At any point P(x y y, z) and at any time t, (2.26) defines a vector. If we 
keep P fixed, the vector u can still change because of the time dependence 
of its components A\, A%, A%. If we keep the time fixed, we note that 
the vector at P(z, y, z) will, in general, differ from the vector at Q(x + dx, 
y + dy, z + dz). Now, in the calculus, the student has learned how to 
find the change (or differential) of a single function of x, y, z, t. What 
difficulties do we encounter in the case of a vector? Actually none, since 
we easily note that u will change (in magnitude and/or direction) if and 
only if the components of u change. The vectors i, j, k are assumed fixed 
in space throughout the discussion. We are thus led to the following 
definition for the differential of a vector : 

rfu = dA^i + dA 2 j + dA s k (2.27) 

, , A dA t . dAi , . dAt , dA % ,. . t rt 

where dA l = - r - dx + -^ dy + -r dz + -^- dt i = 1, 2, 3 
dx dy dz dt 

If x, y, z are functions of t, then 



< 2 - 29 > 



In particular, let r == xi + yj + zk be the position vector of a moving 
particle P(x, y, z). Then 



, dv d 2 r d*x. , d*y . . dh . /001X 

and as ^ = ^=dp 1 + ^ J + ^ k (2 - 31) 

Equations (2.30) and (2.31) are, by definition, the velocity and accelera- 
tion of the particle. 
If a vector u depends on a single variable t, we can define 



du _ Um u(t + A*) - u(Q 
dt 



(see Fig. 2.18). 
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u(f+At) 




ttff) 

FIG. 2.18 

It is easy to verify that (2.32) is equivalent to (2.28). If 

u = u(x, y, z, . . .) 



du i; ufo + Ax, y,z, . . .) - u(x, y, z, . 

= lim 



then 



du dAi. dA 2 . dAi. / QQ\ 

^ = -aF 1 + -dF J + 17 k (2 ' 33) 

2.11. Differentiation Rules. Consider 



Afl - v(t) = u(< + A<) v(< + A<) - u(0 v(<) 
= (u(<) + Au) (v(<) + Av) - u(0 
= u Av + v Au + Au Av 



v(t + AO - <p(t) Av , Au . A Av 

Hence ^ - _J_J^ = u . _ + v . _ + Au . _ 



and -j- = lim ^ ^- _ w , w . 

#* AJ-+O At ar at 

d ( \ dv . du , n , x 

or -r- (u v) = u -j h v -j- (2.34) 

Equation (2.34) also can be easily obtained by writing u and v in com- 

3 33 

ponent form. Thus <p / u t v t , - = 7 u % - + / v l -rr> 

/ ^ at / j at / j at 

d<p __ dv du 

Similarly 

(u X v) = u X + X v (2 35) 

Example 2.12. Let u be a vector of magnitude u. Then u u w 8 so that 



This is a very useful result. In particular, if the magnitude of u remains constant, 
jg- and u -rr 0. This implies, in general, that -T- is perpendicular to u, if 
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|u| = constant and 
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0. The reader should give a geometric proof of this 



statement. 

Example 2.13. Motion in a Plane. Let r be the position vector of a point P 
moving in a plane having polar coordinates (r, 6) (see Fig. 2.19). Now r = rR, R a 

unit vector. Since R = cos 6 i -f sin j, we have P = -^- = sin 6 i -f cos 6 j. 

Why is P perpendicular to R? Notice that P is also a unit vector. Differentiating 
r = rR yields 

dr dr _ , dR dr _ . dR d0 



and 



dv_dV drdR^ c/rc^ ^0 dedPdO 

dt ~ dl* ~*~ dt do dt^~ dtdt ~^~ r dt 2 ^~ r dt dO dt 



so that the acceleration of the particle is 



(2.38) 



since -r- = cos i sin 0j = R. If the particle moves under the action of a 
dB 

central force field, f = /R, then ^ (r 2 ^ = so that -|-r 2 ~ = h = constant. The 

sectoral area swept out by the particle in time dt is dA = ^r 2 d0, so that r- = ^ ; and 

equal areas are swept out in equal periods of time. This is Kepler's first law of 
planetary motion. t 




FIG. 2.19 




y 



FIG. 2.20 



Example 2.14. Frenet-Serret Formulas. A three-dimensional curve, T, in a 
Euclidean space can be represented by the locus of the end point of the position 
vector given by 

r(0 - x(t)i + y(t)j -f- z()k (2.39) 

where i is a parameter ranging over a set of values to ^ t ^ fa. If is arc length along 

- 



1 from the calculus. It is natural 



the space curve, then -7- -r 
ds ds 

to define -T- as the unit tangent vector to the space curve T (see Fig. 2.20). Since 
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t = -r is a unit vector, -y- is perpendicular to t. Moreover, ~r tells us how fast the 

ds ds * * ' ds 

direction of t is changing with respect to arc length s. Hence we define the curvature, 
K, of the space curve r by * 2 = -7- -j-- **, in general, is a function of *, and hence K 
varies from point to point. The principal normal vector to the space curve F is 
defined to be the unit vector, n ; parallel to -7- Thus 

g = *n (2.40) 

The reciprocal of the curvature is called the radius of curvature, /> *= I/*. At any 
point P on r we now have two vectors t and n at right angles to each other. This 
enables us to set up a local coordinate system at P by defining a third vector at right 
angles to t and n. We define as the btnormal the vector b = t X n. The three 
fundamental vectors t, n, b form a trihedral at P; any vector associated with the 
space curve r can be written as a linear combination of t, n, b. 

Let us now evaluate -y- and -y- From b t = we obtain -7- t -f- b -r- = or 
as as as as 

_ . t = since b n = 0. Hence -y- is perpendicular to t. Since -=- is also perpendic- 

ds ds ^ ds * * 

ular to b (b is a unit vector), we see that -r- must be parallel to n. Consequently 
-r- = ^n, where r by definition is the magnitude of -7- r is called the torsion of the 

curve T. To obtain -y-> we note that n = b X t so that 
as 

*! = b X ? + X t = b X -en +m X t = -t -rb 

ds ds ds 

The famous Frenet-Serret formulas are 

dt 

3- = Kll 

ds 

*!. -(rt+rb) (2.41) 

db 



As an example of (2.41) we consider the circular helix given by 
r = a cos t i -f- a sin t j 4- bfk 

We have t * T- ( a sin ti + a cos t j -f 6k) -7-- From t t = 1 we obtain 



so that 

t (-a sin ti + a cos t j + 6k)(a* -f 6 2 )-* 

i i yj* 

Thus *n - j - ( -a cos f i - a sin t j) (a 2 -f- b 2 )" 1 , and K - a(a 2 + b 2 )- 1 , since * - ? 
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From b t X n let the reader show that b = (b am ti b cos t j + ak)(a 2 -f & s )~i, 
Q . rn - (6 cos t i + 6 sin t j)(a 2 + 6')- 1 , r - 6(a* + 6 1 )" 1 . 

Example 2.15. Pursuit Problems. Let us consider the problem of a missile M 

pursuing a target T, the motion taking place 



y 




FIG. 2.21 
Differentiating the identity r VT = rFr cos <p yields 

- (V r - VA/) VT - V T ^ (r cos 



in the xy plane. TT and VT are the posi- 
tion and velocity vectors of the target; FA/ 
and VA/ are the position and velocity vec- 
tors of the missile; 0, <p, $ are defined by 
Fig. 2.21. 

Let r = IT TA/ so that -^ VT VA/ 
at 

and r ~ = r V T - r VA/. Thus r -^ 
eu ar 

= rFT cos (p rV M cos 6, and 

^ - FT cos ? - FA/ cos (2.42) 



r cos 



(2.43) 



If t is the unit tangent vector to the curve r traversed by the target, then VT * Fyt 

rft . (see 



and 



dt A i- ^ ^Vy ^FT A , T r ^ rft 
- -n so that p- - t -f F r -57 n since -r- * 



dt 



dt 



Example 2.14). Equation (2.43) becomes 



r cos ^> - rV T - sin ^ -f V, - 



- <p) - FT ^ (r cos y?) 

+ r cos ? ~ (2.44) 



For the special case of constant target speed, ^ 0, (2.44) become 



rVr -IT sin ^ -f Fj, FA/FT cos (<p 6) FT -r. (r cos < 



(2.45) 



Let us apply (2.42), (2.45) to the dog-rabbit problem. At i - the rabbit starts at 
the origin and runs along the positive y axis with constant speed VT- A dog starts at 
(a, 0) at t = and pursues the rabbit in such a manner (direct pursuit) that = 
throughout the motion. The constant speed of the dog is Fj/. We have 



j - FT cos * - FA/ 
c* 



(2.46) 



snce 



V\i FA/FT cos <f> VT j, 
r/2. Equations (2.46) can be written as 
F r (r cos ) + y* - 



cos 



Integration yields Frr cos <p 4- FA^ (Fj, Fiv)^ -f FA/C, since r a, ^> ir/2, 
at e 0. The rabbit is caught when r - 0, or at t FA/a(Fj, - Vj.)-. 
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Problems 

1. Prove (2.35), (2.36). 

2. Show that ^ (r X ^Q - r X ^ 

3. r = a cos wt + b sin u><; a, b, u are constants. Prove that r X -r. = o>a X b and 



4. R is a unit vector in the direction r, r |r| . Show that R X dR r f r - 

5. If ^ a X a, ~ = w X b, show that ^ (a X b) - X (a X b). 

d*r 

6. If r ae"' + be~ wf , a, b, w constants, show that -5% w s r - 0. 

7. Consider the differential equation 

g+2A*? + Bu = (2.47) 

A, B constants. Assume a solution of the form u(0 = Ce wt , C a constant vector, w a 
constant scalar. Show that u(t) = Cie^i* + C2e f 2 ' is a solution of (2.47) where toi, ?z 
are roots of w; 2 4- 2Aw; -f 5 0. What if MI 1^2? 

8. Find a vector u which satisfies -gj -^ 2 -rr = such that u i, ~ j, 



9. For the space curve x 3t <*, y 3J 2 , 2 3< + t* show that 



10. Show that - - 2 t + n - *crb. 

d 8 d* 

11. Four particles on the corners of a square (sides -= 6) begin to move toward each 
other in a clockwise fashion in a direct-pursuit course. Each has constant speed V. 
Show that they move a distance 6 before contact takes place. 

12. In navigational pursuit VM sin = VT sin <p. Interpret this result geo- 
metrically, assuming 0, ^ constants. 

13. A target moves on the circumference of a circle with constant speed V. A 
missile starts at the center of this circle and pursues the target. The speed of the 
missile is also V. The pursuit is such that the center of the circle, the missile, and the 
target are collinear. Show that the target moves one-fourth of the circumference up 
to the moment of capture. 

2.12. The Gradient Let <p(z, 2/, z) be any differentiable space func- 
tion. From the calculus 

*-& b+ 5* + S* (2 - 48) 

The right-hand side of (2.48) suggests that the scalar product of two 
vectors might be involved. If r = xi + yj + zk is the position vector of 
the point P(x, y, z), then dr = dx i + dy j + dz k. Hence, to express d<? 
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as a scalar product, one need only define the vector with components 

-r^> > ~ This vector is called the gradient of (?(x. y, z), written 
ox dy dz 

del <p 3= V<p. We define 

"K l + 5 1 + S* < 2 ' 49 ' 

so that 

d<f> = dr V<? (2.50) 

The reader should recall from the calculus that d<? represents the change 
in <p as we move from P(x, y, z) to Q(x + dx, y + dy, z + dz), except 
for infinitesimals of higher order. Equation (2.50) states that this 
change in <p can be obtained by evaluating the gradient of <p at P and 
computing the scalar product of V<? and dr, dr being the vector from 
Pto Q. 

We now give a geometrical interpretation of V<p. From <p(x, y, z) we 
can form a family of surfaces <p(x, y, z) = constant. The surface S 
given by <p(x, y, z) = <p(z , 2/o, 20) contains the point P(z , 2/o, ZQ). <p(x, y, 2) 
has the constant value <f>(x$, y^ ZQ) if we remain on this particular sur- 
face S. Now let Q be any point on S near P. Since dy 0, we have, 
from (2.50), di V<p = 0. Hence V<p is perpendicular to dr. Thus V??, 
at P, is normal to all possible tangents to the surface at P so that V<p neces- 
sarily must be normal to the surface <p(x, y, z) = ^(z , 2/o, 20) at P(ZO, 2/0, 20). 
This is a highly important result and should be thoroughly understood 
by the reader. 

Let us now return to (2.50). If ds = |dr|, (2.50) states that 

^ = J-V^ = u-V^ (2.50') 

ds ds v 7 

where |u| = 1. Hence -^- = |V?>| cos 0, where is the angle between 

ds 

u and V<p. It is obvious that j- has its maximum when 8 = 0, 



The greatest change in <p(x, y, z) at P(x 0t z/ , zo) occurs in the direction of 
V<p, that is, the greatest change in <p occurs when we move normal to the 
surface <p(x, y, z) = ^>(z , 2/o, zg). This is to be expected. Let the reader 
show that V(^>i + ^2) = V^i + 



Example 2.16. Let us find a unit vector perpendicular to the surface 

x 9 - xy + yz 1 
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at the point F(l, 1,1). Here <f>(x, y, z) z 2 xy + yz, and 

V* - (2x - y)i + (* - x)J + yk 
Vv> - i -f k at P(l, 1, 1) 



Example 2.17. By direct computation Vr = r/r for r = (x* + # 2 4- z 2 )*. We 
obtain this result by a different method. The surface r constant is a sphere with 
center at the origin. Since Vr is perpendicular to the sphere, Vr * ft. Now 

dr Vr dr = ft dt = /r dr 

so that/ = 1/r. Q.E.D. 

Example 2.18. Consider V/(w), ?* = u(x, y, z). We have 



where /'(w) - - Hence V/(w) f'(u)Vu. 

Example 2.19. The operator del, V = i- hjr h k - > is a useful concept. 

da* a// o2 

It is helpful to keep in mind that V acts both as a differential operator and as a vector 
in some sense. 

ThusV, = (!+,+) = ,J? + jg + kg- ItiBea.ytoshowthat 



V(Civ>i -F C 2 ^2) = CiV<^i -h C 2 V<p 2 if Ci, C 2 are constants. Let the reader show that 

V(^i^ 2 ) = <piV<p2 -f p2V^i (2.51) 

Notice how (2.51) conforms to the rule of calculus for the derivative of a product. 

Problems 

1. Find the equation of the tangent plane to the surface xy z = 1 at the point 
(2, 1, 1). 

2. Show that V(a r) = a, where a is a constant vector. 

3. If r = (z 2 -f t/ 2 -f z*)*, show that Vr n = nr n ~ 2 r. 

4. If * (r X a) (r X b), show that V*> = b X (r X a) -f a X (r X b) when a 
and b are constant vectors. 

5. Show that the ellipse r\ + r 2 = c\ and the hyperbola ri r 2 = c 2 intersect at 
right angles when they have the same foci. 

6. Find the change of <p = x*y -f- yz* xz in the direction normal to the surface 
yx * + X y 2 -f z*y = 3 at the point P(l, 1, 1). 

7. Prove (2.51). 

8. If / /(MI, W2, . . . , Wn), w* w*(x, T/, 2), fc - 1, 2, . . . , n, show that 



n 

-/-I 



9. If > - *?(x, y, 2, 1), show that gf = ^f + ^ 
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10. The equation of an ellipse is n -f- r 2 - constant. Why is V(n + r 2 ) T ^*0 if 
T is a unit tangent to the ellipse at the point P? V(ri -f r 2 ) is computed at P. From 

V(ri + r) = Vri -h Vr 2 , Vri = > Vr 2 * give a geometric interpretation to 

V(ri + r 2 ) T = 

2.13. The Divergence of a Vector. Let us consider the motion of a 

fluid of density p(x, y, z, t), the veloc- 
ity of the fluid at any point being given 
as V = V(x, y,z, t). Let 

f = p v = Xi + Yj + Zk 

A d E We now concentrate on the flow of 
y fluid through a small parallelogram 




ABCDEFGH (Fig. 2.22) of dimensions 
dx, dy, dz. At time t let us calcu- 

F 2 22 late the amount of fluid entering the 

box through the face ABCD. The 
x and z components of the velocity contribute nothing to the flow through 

ABCD. Now Y(x, y, z, t) has the dimension j^, M = mass, L = length, 

T = time. Thus Y dx dz has the dimension MT~ l and denotes the gain 
of mass per unit time by the box because of flow through the face ABCD. 

f dY \ 

Similarly ( Y + dy 1 dx dz represents the loss of mass per unit time 

because of flow through the face EFGH at the same time t. The loss 

d Y 

of mass per unit time is thus -r dx dy dz. If we also take into consider- 

dy 

ation the other faces of the box, we find that the total loss of mass per 
unit time is 

(AY f)V A7\ 
~ + ^- + -f- ) dx dy dz (2.52) 

dx dy dz / 

dX dY d7 

Hence h T h -r- is the loss of mass per unit time per unit volume. 

dx dy dz 

r\ ILT \ \ r \ rr 

The scalar -^ h , + -z-~ is called the divergence of the vector field f , 
dx dy dz 

written 

divf = g + ^ + f (2.53) 

v *v \ 

Returning to the operator V = 1^ hJT; f-k-r->we note that 

ox dy dz 
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7 

provided we interpret V as both a vector and a differential operator. Let 
the reader show that 

V Of) = v?V f + V<f> f (2.55) 

Example 2.20. For r - xi + yj + zk, V r - 3. For f - r-r, 

V f - r~V r + Vr~ 3 r 3r~ 8 - 3r~ 4 Vr r = 3r~ 3 - Sr" 5 ! r * 

(see Example 2.17). 
Example 2.21. What is the divergence of a gradient? 



This important scalar is called the Laplacian of <p(x, t/, 2). 

VV = +f + (2-56) 



2.14. The Curl of a Vector. Let u,, i = 1, 2, 3, be the components of 
the velocity of a fluid in an x l xV Euclidean coordinate system. The 
differential change in the components of v is given by 



dv\ , 1 (dv, dVj\ dv t 

dx - dx - 



t , . 
-Bi dt (2 ' 57) 

The terms s j; = T~ ^ i,j = 1, 2, 3, now occupy our attention. The 

fy are the elements of a skew-symmetric matrix. As a result there are 
three important elements, listed as 

^ 3 ** v * 
1 " dx 1 ~ ~dx* 



2 - W ~ 5? - 

_ d?>2 dvi 

8 " dF 1 " w 
The vector ^i + t$ + tjs. is defined and called the curl of v. Using the 
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V operator, we note that the curl of v may be written 

j k 

o o 



curl v = V X v = 



Example 2.22. Let f = x z yzi 2xyz z j + y z zk. 



dz 

V* 



(2.59) 



V Xf = 



i 



- (2.7/2 - 

Example 2.23. Consider V X 
v X M 



-2xyz* y z z 



- x z t/j - (2yz z + z 2 *)k 
We have, for f = ?/i + *>j + wk, 



i J k 




dx dy dz 

<f>U <(ffl <pW 




i j k 




i J k 


a a a 




d<p dtp d<f> 


^ dx dy dz 




Jx ~dy ~dz 


u v w 




u v w 


^V X f + (V^) X f 



(2.60) 

To obtain the curl of <pf, keep <p fixed, and let V operate on f, yielding <?V X f ; then keep 
f fixed, and allow V to operate on <p, yielding V<p, and complete the vector product 
(v>) X f. The sum of these operations yields V X (<pf). 
Example 2.24. The curl of a gradient is zero. 



v x 



i j k 

JL. A A 

dx dy dz 

d<f> d<f> d<f> 

dx dy dz 

. / ay _ a 
o 



i 

J 



dz dx dx dz 



4. k I 



dx dy 



provided ^> has continuous mixed second derivatives. 

2.15. Further Properties of V Operator. We define the product u V 
to be the scalar differential operator 



We then have 



d 
**di 



d_ 
'dy 



**~te 



(2.61) 



- 



- 



dv 
,- 
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Let us now investigate u X (V X v). Assuming that we can expand 
this term by the rule of the middle factor [remember that the expansion 
a X (b X c) = (a c)b (a b)c holds true only for vectors; V, strictly 
speaking, is not a vector], we have 

u X (V X v) = V,,(u . v) - ( U V)v (2.62) 

The subscript v in the term V,,(u v) means that V,. operates only on the 
components of v. Thus 

V.(u v) = V t ,(u x r T + ?VV + */*>'*) 



Interchanging the role of u and v yields 

v X (V X u) = V tt (u v) - (v V)u (2.(>3) 

Adding (2.62) and (2.63) results in 

V M (u v) + V v (u v) = u X (V X v) + v X (V X u) + (u V)v 

+ (v V)u 
and 

V(u v) = u X (V X v) + v X (V X u) + (u - V)v + (v V)ti (2.64) 

The above analysis in no way constitutes a proof of (2.64). We leave 
it to the reader to verify (2.64) by direct expansion. The same remarks 
hold for the following examples: 

V X (u X v) = V M X (u X v) + V v X (u X v) 

= (v V)u - v(V u) + (u V)v - (V V)WL (2.65) 
V (u X v) = V w (u X v) + V, (u X v) 
= (V X u) v - V v (v X u) 
= (V X u) v - (V X v) - u (2.66) 

We now list some important identities : 

'(1) V(uv) = uVv + vVu 

t2) V (v?u) = <pV u + (V?) u 

<3) V X (<pu) = <?V X u + (V?) X u 

<4) V X (V<f>) = 

J5) V (V X u) = 

(6) V (u X v) = (V X u) v - (V X v) u 

(7) V X (u X v) = (v V)u + (u V)v - v(V u) - u(V v) 

(8) V(u v) = u X (V X v) + v X (V X u) + (u V)v + (v V)u 
<9) V X (V X u) = V(V u) - V 2 u 

(10) (u V)r = u 
V-r = 3 
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(12) V X r = 
(13) d? = dr V?> + ^ dt 

(14) dt = (dr- V)f + j t dt 
(15) V (r~ 8 r) = 

Problems 

1. Show that the divergence of a curl is zero. 

2. Find the divergence and curl of xi yj/x -f y, of x cos zi + y\nxj z 2 k. 

3. If A axi + />//j + ttzk, show that V(A r) = 2 A. 

4. Show that V 2 (l/r) - 0, r - (a; 2 + ?/ 2 4- * 2 )*. 
6. Show that V X l/(r)r] 0. 

6. If f wVt> show that f V X f 0, u not constant. 

7. Show that (v V)v - -g-Vv 2 - v X (V X v). 

8. If A is a constant unit vector show that A [V(A v) V X (v X A)] * V v. 

9. If w is a constant vector, r xi -f yj -f zk, show that V X (w X r) = 2w. 

10. Show that V (u X v) * (V X u) v - (V X v) u. 

11. Show that dt - (dr V)f -f ^ dt if f - f(x, y, z, t). 

ol 

12. Show that V X (V X u) - V(V u) - V 2 u. 

13. Let u u(x, y, z), v * v(x, y, z), and assume Vu X Vv = 0. Let dr be the 
vector from P to Q; P and Q are points on the surface u(x t y, z) constant. From 
dv ** dr Vv, show that dv = and hence that v remains constant when u is constant. 
This implies that v f(u) or F(w, v) 0. Show conversely that if u and v satisfy a 

dF dF 

relationship F(u t v) = then Vu X Vv = 0. Hiw<* vFCw, v) = T Vw -f T" ^ r<; ' 

oli oV 

dF dF 

We assume that - and do not both vanish identically. 
du dv 

14. Prove that a necessary and sufficient condition that u, v, w satisfy an equation 
F (w, v, uO *" \s that Vu Vi; X Vw 0, or 

"ch* dw dw 
dx djy dz 
dr dv dv 



*, V, 



dx dy dz 
dw dw dw 
dx dy Hz 



This determinant is called the Jacobian of (u, v, w) with respect to (a;, y, z). 

15. Let A - V X W), VV * 0, * * X(x}Y(y)Z(z). Show that A V X A - 0. 

16. Show that (V X f) (u X v) = [(u V)f] v - [(v V)f] u. 

2.16. Orthogonal Curvilinear Coordinates. Up to the present moment 
we have expressed the formulas for the gradient, divergence, curl, and 
Laplacian in the familiar rectangular cartesian coordinate system. It is 
often quite necessary to express the above quantities in other coordinate 
systems. For example, if one were to solve V 2 F = subject to the 
boundary condition that F = constant on the sphere a: 2 + y 2 + z* = a 2 , 
one would find it of great aid to express V 2 F in spherical coordinates. 
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The boundary conditions of a physical problem dictate to a great extent 
the coordinate system to be used. 

Let us now consider a spherical coordinate system (see Fig. 2.23). 
The relationships between x, y, z and r, 0, <? are 



6 = cos" 1 



+ y 2 + 



tan- 1L 



x 



and 



x r sin 6 cos 
y r sin sin < 
z = r cos 



(2.67) 



(2.68) 



, *) 



Let us note the following pertinent facts: Through any point P(x, y } 2), 
other than the origin, there pass the sphere r = constant, the cone 
6 = constant, and the plane <f> = 
constant. These surfaces intersect 
in pairs which yield three curves 
through P. The intersection of the 
sphere and the cone is a circle. 
Along this curve only <p can change, 
since r and are constant. This 
curve is called appropriately the 
<p curve. At P we construct a unit 
vector, e f , tangent to the <p curve. 
The direction of e^ is chosen in the 
direction of positive increase in <p. 
Similarly one obtains e r and e. The 
reader can easily verify that these 




FIG. 2.23 



unit tangents form an orthogonal trihedral at P such that e r X e* = e^. 
The vectors e r , e, e^, form a basis for spherical coordinates in exactly 
the same manner that i, j, k form a basis for rectangular coordinates. 
Any vector at P may be written f = / r e r + fee 9 + /*&<?, where / r , /*, f, 
are the projections of f on the vectors e r , e*, e v , respectively. Unlike 
i, j, k the vectors e r , e*, e v change directions as we move from point to 
point. Thus we may expect to find more complicated formulas arising 
when the gradient, divergence, etc., are computed in spherical coordinates. 
Since Vr is perpendicular to the surface r = constant, Vr is parallel 
to c r . Similarly V0 is parallel to e,; V^> is parallel to e,. Thus 



e r 

e* 



h r Vr 
heVB 



(2.69) 
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Let dr be a vector of length ds parallel to e r . Then dr = dr Vr [see 
(2.50)], so that h r dr = dr h r Vr = dr e r = ds. Since dr = ds, we have 
h r = 1. Now let dr be a vector of length ds parallel to e$. We have 
d0 = dr V0, ft* d0 = dr heVO = dr e$ = ds. It is seen that he is that 
factor which must be multiplied into dO to yield arc length. Thus 
h e = r, and similarly h? = r sin 0. In spherical coordinates 

ds 2 = dr 2 + r 2 dO 2 + r 2 sin 2 6 d<p z 

The positive square roots of the coefficients of dr 2 , d0 2 , d<p 2 yield h r , ho, h?, 
respectively. 

We can now write 

e r = Vr = e* X e, = r 2 sin 0V0 X V<p 

e = rV0 = e^, X e r = r sin 0V^> X Vr (2.70) 

e<p = r sin 0V^? = e r X e e = rVr X V0 



The differential of volume is 



dV = di d2 dsg = hrhehp dr dO dp 
= r 2 sin dr d0 d<^ 



It is easy to show that Vr V0 X Vv? = (hrheh^}' 1 = (r 2 sin 0)" 1 . 
Now we consider the gradient of /(r, 0, <p) . We have 






Equation (2.71) is the gradient of- a scalar in spherical coordinates. 
To compute the divergence of f = / r e r + fe&e + f^^ we write 



f = f r r z sin 0V0 X V<p + f e r sin 0V^ X Vr + f^rVr X V<^? 
from (2.70). Then 

V f = V(/ r r 2 sin 0) V0 X V^ + V(/*r sin 0) V^> X Vr + V(./ Vr X 
since V (V0 X Vv?) = 0, etc. [see formula for V (u X v)]. But 



V(/ r r 2 sin 0) V0 X V^ = (f r r 2 sin 0)V0 V0 X 

ou 



(/ ' r2 sin e) " ^ 8in 



ra; 
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We thus obtain 

V f = - T 4-^ I ~ (f r r~ sin B) + -^ (./> sin B) + - (/,r) 
r 2 sin B idr 36 d<p 

Equation (2.72) is the divergence of f in spherical coordinates. 
If we apply (2.71) and (2.72) to f = VF, we obtain 



(2.72) 



V 2 F = 



1 



r 2 sin B 



\T ( r * ?) + ( 8in ' ) + (^ I-)] (^ 
Idr \ dr / dO \ 60 / d<p \sm B d<p / J 

Equation (2.73) is the Laplacian of F in spherical coordinates. 

To obtain the curl of f , let f = f r h r Vr + feh e VB + f^V^. It can be 
easily shown that 

h r e r 

A A d 
6V 'dB 

krfr 



V X f 



(2.74) 



For the general orthogonal curvilinear coordinate system (HI, u Zj 
where ds 2 = /if du\ + hi du\ + h\ du\ we list without proof 



V-f = 



V X f = 



V 2 / = 



1 



- 



h s fz 



(2.75) 



I 1 |- I : 1 j- I 

oiii \ h\ dui/ du% \ hi du^/ du$ \ h$ du$ 



It would be a good exercise for the student to derive (2.75). 

Problems 

1. For x = r sin 6 cos ^>, 2/ = r sin sin <p t 2 = r cos show that the form 

rfs 2 - dx 2 + di/ 2 -f rfz 2 



becomes d* dr 2 + r 2 d0 2 -f r 2 sin 2 

2. For x r cos <f> t y r sin ^>, z z show that (is 2 dr 2 -f- r 2 d^> 2 

3. Express V/, V f , V X f in cylindrical coordinates and show that 
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4. Solve V'F - in spherical coordinates if V V(r). 

6. Solve V s F = in cylindrical coordinates if V F(r). 

6. Show that V X [/Mr] = 0, r a spherical coordinate. 

7. By making use of V 2 V - V(V V) - V X (V X V), find V 2 V for V - v(r)e r , V 
purely radial in spherical coordinates. 

8. Express V 2 V, for V = /(r)e r -f v(z)e 2 , in cylindrical coordinates. 
0. Consider the equation 

(X + /z)V(V S) + juV'S - p j 

\, p, p constants. Assume s =* ^ tpt Si, p constant, i * \/ 1 an d show that 

(x -f M )v(v s,) + (M + PP*)SI - 

Next show that [V 2 + (/i + PP 2 )/X -f /*](? Si) - 0, X -f M * 0. 

10. If A - V X (XT), VV - 0, ^ - R(r)Q(9)*(<p), show that A V X A - 0. Is it 
necessary that ^ - J 
3 



11. If <fc 2 - da; daf, 



1 , t/ 2 , ?/ 8 ), i - 1, 2, 3, show that 



where 

12. Derive (2.74). 

13. Derive (2.75). 



2.17. The Line Integral. We start with a vector field 

AS 




f = X(x, y, z)i + Y(x, y, z)j 

+ Z(x, y, 



Let r(0 = x(t)i + y(t)j + 
a ^ t ^ b y be a space curve T join- 
ing the points A and B with posi- 
tion vectors r(a), r(6). One may 
subdivide T into n parts by sub- 
dividing t into 



Fir,. 2.24 



(see Fig. 2.24). Lot Ar, = r(^) 

that tj-i ^ , ^ tj. We can compute #(,), 

vector f(&). One then forms the sum 



t 3 < ' < t n = b 

,_i), and let , be any number such 
;j)> *(;)> which yields the 



(2.76) 



If lira S. exists independent of how the & are chosen provided maximum 
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| AT, | > as n - oo , we define this limit to be the line integral of f along 
the curve T from A to B. 

As in the calculus the limit is written 



If f is continuous along T and if F has continuous turning tangents, that 

is, -T- is continuous, then (2.77) exists from Riemann integration theory 
ds 

and can be written 



* - f b i v\ dx ^ v\ d u 

A Ja \ dt ' dt 

It (2.77') 

We use (2.77') as a means of evaluating the line integral. There will be 
some vector fields for which the line integral from A to B will be inde- 
pendent of the curve T joining A and B. Such vector fields are said 
to be conservative. If f is a force field, (2.77) defines the work done by 
the force field as one moves a unit particle (mass or charge) from A to J5. 
We now work out a few examples and then take up the case of conserva- 
tive vector fields. 

Example 2.25. Let f xyi + zj xyzk, and let the path of integration be the 
curve x = t, y = J 2 , z /, the integration performed from the origin to the point 
P(l, 1, 1), the range of t being given by ^ t ^ 1. Along the curve, f = f*i + tj ( 4 k, 

and dr - dt - (i + 2tj 4- k) dt so that f rfr - (<* + 2/ 2 - t 4 ) dt. Hence 

rr> 

-dl - 

If we choose the straight-line path x = t, y t y z = t from the origin to the point 

/" P F 1 

P(l, 1, 1), we obtain / f dr / (< 2 -f- < < 3 ) eft = T V It is seen that the vector 
field f is not conservative. 

Example 2.26. Let f x*i + y*j, and let the path of integration be the parabola 
y x* from (0, 0) to (1, 1). Let x - t so that y J 2 , ^ J ^ 1, and 

1 f l 



For the same f let us compute the line integral by moving along the x axis from x 
to x J and then moving along the line x 1 from t/ to y 1. Although the 
continuous curve does not have a continuous tangent at the point (1, 0), we need not 
be concerned since one point of discontinuity does not affec,t the Riemann integral 
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provided -j-. is bounded in the neighborhood of the discontinuity. We have 



mi) f(i,o) 

/ f di = / f dr 

7(0,0; 7(o,o) 



. 
+ / f dr 



Along the first part of the curve x = t, y 0, dx = dt, dy = 0. Along the second 
part of the curve x = 1, dx 0, y t, dy = dt, and 

/(i.i) n ri ~ 

\ t dr = / t z dt + t 3 dt = Tg- 

7(o,o) 7o 7o 12 

We become suspicious and guess that f is conservative. Notice that 

(j-3 y4 \ j3 ? ,4 

-5- 4- '-r 4- constant I = V> ^> = -5--f- 1 T-|- constant 
34 / 34 

Hence f dr = V<^ dr = d<? so that 

- vU) 



/B 
f dr depends only upon the upper and lower 

limits and is independent of the path of integration from A to B. 

Example 2.27. We have just seen from Example 2.26 that if f = V>, <p single- 
valued, then f is a conservative vector field. Conversely, let us assume that /f dr is 
independent of the path. We show that f is the gradient of a scalar. Define 



., 

<p(x, ?/, 2) = / f dr 

JPo(xn,yn,zo) 

The value of <f> depends only on the upper limit (we keep Po fixed). Then 

fQ(x + Ax,y,z) 
<p(x -f Ax, y, z) = \ f dr 

J P(ro,yo,zo) 



and sp(x 4- Ax, i/, 2) <^(x, y, 2) = / ' X(x, //, 2) dx 4~ Y(x, y, 2) dy 

yp(x,y,2) 

4- Z(x, y, z} dz 

We choose the straight line path from P to Q as our curve of integration. Then 
dy ** dz *=* 0, and 

/x-f Ax 
X(x, y, z) dx 

Applying the theorem of the mean for integrals yields 

<p(x -f Ax, y, z) - <p(x, y, z) = X(, y, z) Ax x ^ ^ x -f Ax 
so that T^ lim 1 -'- L - tL ^ ' ' J == lim A'(. ?/. z) = X(x. y. 2) 



assuming X continuous. Similarly Y * -^, 7 = -^, so that 

d?y dz 

,. + y J + ft.i +j+k Q.E.D. 
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A quick test to determine whether f is conservative is the following: 
Note that, if f = V<p, then V X f = 0. Conversely, assume V X f = 0. 
Then for f = Xi + Yj + Zk we have 

dX = dF 

dy dx 

w - f 

dZ = dX 

dx dz 

Let 

<p(x, y, z) = / X(x, y,z) dx + f y F(x , y, z) dy 

Jxo Jyo 

Z(x,, 2/0, z) dz (2.79) 



Jzo 

We now show that f = V<p. From the calculus 



= X(x, y, z) 



Y(X ' y ' z} - 



dx 



= Y(x, y, z) - Y(x, y, 2) + Y(x , y, z) 
= Y(x, y, z) 

dX , . v dY 



= Z(x, y, z) - Z(x<>, y, z) + Z(x, y, z) - Z(x , ye, z) + Z(x<>, y , z) 
= Z(x, y, z) 

Hence f = Xi + Yj + Zk = gi + |? j + gk = V v 

The constants x , y^ ZQ can be chosen arbitrarily. 

We have proved that a necessary and sufficient condition that f be the 
gradient of a scalar is that V X f = 0. Thus for f conservative we have 
f = V<p or V X f = 0. We also say that such an f is an irrotational 
vector field. 

Example 2.28. It is easy to show that f = 2xye*i + jVj + x*ye*k is irrotational. 
Then 

<p(x, y, z) - f* 2xye' dx + fj We* dy + f* O 2 - e* dz 

= x 2 2/e 
and f = V(x z ye* -f constant) 
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Problems 

1. Given f e*i xyj, evaluate /f dr along the curve y x 8 from the origin to 
the point (2, 8). 

2. f (y + sin z)i + zj + x cos zk. Show that f is conservative, and find <f> so 
that f V*>. 

3. Let f j/i -f xj. Evaluate Jf di around the circle with center at the origin 
and radius a. 

4. Show that if f = V<f>, <p single-valued, the line integral around a closed path, 
written fi dr, vanishes. Prove the converse. 

5. Let f - (-j/i -j- xj)/x 2 -f y*. Show that f * V tan~ ! (y/x) and integrate f 
around any closed path surrounding the origin, and show that for this path 

ft di - 2ir 

Why does this integral not vanish? See Prob. 4. Notice that f is not defined at the 
origin. The curve of integration contains the origin in its interior. This will be 
important in complex-variable theory. 

6. If A is a constant vector, why is it true that ^A dr = 0? 

2.18. Stokes' s Theorem. We begin by studying the locus of the end 
points of the vector 

r = x(u, v)i + y(u, v)j + z(u, v}k (2.80) 

where u and v range over a continuous set of values and x, y, z are assumed 
to have continuous partial derivatives in u and v. For a fixed v = VQ 
the end points of r trace a space curve as we let u vary continuously. 
For each v a space curve exists, and if we let u vary, we obtain a locus 
of space curves which collectively form a surface. The curves obtained 
by setting v = constant are called the u curves, and the curves obtained 
by setting u = constant are called the v curves. We thus have a two- 
parameter family of curves forming the surface. 

A simple example will illustrate what we have been talking about. 
Consider 

r = constant 

r = r sin 6 cos <p i + r sin 9 sin v j + r cos 6 k g <p g 2* > (2.81) 

g TT 

We use 6 and <p instead of u and v. Let us notice that r r 
stant, so that the end points of the vector r lie on a sphere of radius r. 
For a fixed 6 = the z component of r, namely, r cos , is constant. 
For ^ <? ^ 2* the end points of r trace out a circle of latitude. The <f> 
curves are thus circles of latitude. It is easy to show that the 6 curves 

are the meridians of longitude. We can show that the 6 curves intersect 

\_ 
the <p curves orthogonally. The expression represents a vector tangent 

to a 6 curve, while represents a vector tangent to a ^ curve. From 
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we have 



\_ 

= r cos cos <p i + r cos sin ^> j r sin k 
ou 

= r sin sin ^? i + r sin cos ^ j 
d^> 

- * RD - 



If we move from a point P(u, v) to the point Q(u + rfw, v + dv), P and 



_ > n-. 

on the surface, then tfr = PQ = 



\-. 

+ cfo. 
dv 



j 
= rfr 



dr dr . 9 , ft dr dr , 
= dn* + 2 rf?/ 
- 



Hence 



. dr ar 

+ -r- 
a?' a?; 



u curve 



where ds is arc length. For the sphere ds 2 = r 2 dO 2 + r 2 sin 2 

Let us now consider a surface of the type given by (2.80) bounded by 
a rectifiable curve F that lies on the surface (see Fig. 2.25). 

The vector X is perpendicular to the surface since and -r- 
du dr du dv 

are tangent to the surface. As we move along the curve F, keeping 
our head in the same direction as 

T- X -r-> we keep track of the area 

du dv r 

to our left. It is this surface, *S, 
with which we keep in touch. F 
will be called the boundary of >S. 
We neglect the rest of the surface 
r(u, v). We now consider a mesh 
on the surface formed by a collec- 
tion of parametric curves (the u and , 
v curves). The mesh will be taken a 
fine enough so that, if (u, v) are the 




v curve 



FIG. 2.25 



coordinates of A, then (u + du, v), (u + du, v + dv), (u, v + dv) are the 
coordinates of B, C, D, respectively (Fig. 2.25). Now consider 

ABCD ftdT 

The value of f at A is f (u, v) ; at B it is f(u + du, r) ; at C it is f (u + du, 
v + dv) ; at D it is t(u, v + dv). Now 

f(u + du, v) = f(u, v) + df u 

= f(u, v) + du 
du 



TT ' V f 



72 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

except for infinitesimals of higher order. Similarly 

f (ti, v + dv) = f (w, v\ + dv f | V J f 
Hence, but for infinitesimals of higher order, 

. v f. dv 

ABCD 



? .v)t\. 

dv / j du 



du 



= (V Xf)- Xdudy, 
v ' du dv \ 

(see Prob. 16, Sec. 2.15). 

v r\- 

The vector -r- du X -r- rfy is normal to the surface S. Its magnitude 
du dv 

is the area of the sector A BCD, except for infinitesimals of higher order, 
since ABCD is, strictly speaking, not a parallelogram. We define 

d6= ^X^dudv (2.82) 

du dv 

so that (> f dr = V X f dd 

j ABCD 

except for infinitesimals of higher order. 

We now sum over the entire network. Interior line integrals cancel 

out in pairs, leaving only & f dr. Also 



(V X f ) dtf -* (V X f ) rfd 

as the mesh gets finer and finer. We thus have Stokes's theorem 

< r f.rfr =fl (V X f)-d<J (2.83) 

^ 
Corwmente 

1. Since r may not be a parametric curve, (2.82) may not hold for a mesh circuit 
containing r as part of its boundary. This is true, but fortunately we need not worry 
about the inequality. The line integrals cancel out in pairs no matter what sub- 
division we use, and for a fine network the contributions of those areas next to r con- 

tribute little to / / V X f dd. The limiting process takes care of this apparent 
negligence. 



VECTOR ANALYSIS 73 

2. Stokes's theorem has been proved for a surface of the type r(w, v) [see (2.80)]. 
The theorem is easily seen to be true if we have a finite number of these surfaces con- 
nected continuously (edges). The case of an infinite number of edges requires further 
consideration, 

3. Stokes's theorem is also true for a surface containing a conical point where no dti 
can be defined. We just neglect to integrate over a small area covering this point. 
Since the area can be made arbitrarily small, it cannot affect the integral. 

4. The reader is referred to the text of Kellogg, "Foundations of Potential Theory," 
for a more rigorous proof of Stokes's theorem. 

5. The tremendous importance of Stokes's theorem cannot be overemphasized. It 
relates a line integral to a surface integral, and conversely. 

6. In order to apply Stokes's theorem it is necessary that V X f exist and be 
integrable over the surface S. 

Examples of Stokes's Theorem 

Example 2.29. Let f = yi -f -rj, and let us evaluate (p f dr, F any rectifiable 
curve in the xy plane. Applying Stokes's theorem, we have 

(f) f - di = //(V X f) k dy dx - //2k - k dy dx - 2A 

8 S 

Thus the area A , bounded by F, is 

A = \fx dy - y dx (2.84) 

For the ellipse x = a cos t, y - b sin /, ^ t ^ 2w, we have dx a sin t dt, 
dy - b cos t dt, and A ^ I a6(cos 2 t + sin 2 1) dt = irab. 

Example 2.30. If V X f = everywhere, it follows from Stokes's theorem that 
fi di = around every closed path. Conversely, assume $i dr = around every 
closed path, and assume V X f is continuous. If V X f ^ 0, then V X f T* at some 
point P. From continuity we have V X f ^ in some neighborhood of P and V X f 
nearly parallel to (V X f)p in this neighborhood. Choose a small plane surface S 
through P with boundary F in this neighborhood of P. The normal to the plane is 

chosen parallel to (V X f)p. Then (p f dr = // V X f dd > 0, a contradiction. 

8 

An irrotational field is characterized by any of the three conditions 

(i) f = V^ 

(ii) V X f = (2.85) 

(iii) i dr = for every closed path 

Any of these conditions implies the other two. 

Example 2.31. Let f = f(x, y, z)a, where a is any constant vector. Applying 
Stokes's theorem yields 



/a-dr - /Tv X (/a)-dd 

S 

<f)fdT - (I vf X a rfd - a (f dd X V/ 
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so that a f (Dfdr // dd X V/J 0. Since a is arbitrary, it follows that 

(f)fdi - // dd X Vf (2.86) 

It can be shown that 

r * f = // (rfd X V) * f (2.87) 



The asterisk can denote scalar, vector, or ordinary multiplication. In 
the latter case f becomes the scalar /. 

Problems 

1. Show that (h r dr = by two methods. 

2. Show that (p a X r dr ** 2a // dd if a is constant. 

s 

5. By Stokes's theorem show that V X VSP 0. 

4. Prove that (D uVv dr * // Vw X Vv dd. 

5 

6. Prove that fuVv dr fvVu dr. 

6. If f cos // i 4- x(l -f- sin //)j, find the value of ^f dr around the circle with 
center at the origin and radius r. 

7. If (f) E dr - - - // B dd for all surfaces S show that V X E - - - ~ 

J C ot JJ C ot 

S 

8. Show that II V X f dd * if S is a closed surface. 

S 

9. f - (# 2 y/ 2 )i -f 2-r^/j. Find y*f dr around the square with vertices at 
(0, 0), (1, 0), (1, 1), (0, 1). Do this by two methods. 

10. If a vector is normal to a surface at every point, show that its curl is zero or is 
tangent to the surface at each point. 

11. Let f = a X g, a any constant vector. Apply Stokes's theorem to f, and show 



that <T)dT X g - JJ (dd X V) X g. 



S 

12. Assume V X f & 0. Show that, if a scalar /(*, y, *0 exists such that 

V X W ) - 
then f V X f - 0. 

13. Show that | dt X r| taken around a closed curve in the xy plane is twice the 
area enclosed by the curve. 

14. Let f X(x, y}i -f Y(x, y)j, and let S be the area bounded by the closed curves 
Fi and T* lying on the xy plane, Fi interior to F 2 . Show that 



// 



f-dd - ()t -dt -< f-dr 



Both line integrals are taken in the counterclockwise sense. 
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2.19. The Divergence Theorem (Gauss). Let S be a closed surface 
containing the volume F in its interior. We assume S has a well-defined 
normal almost everywhere. We now subdivide the volume into many 
elementary volumes. From Sec. 2.13 we note that except for infini- 
tesimals of higher order 

II f dd = V f AT 

AS 

where A$ is the entire surface bounding the elementary volume AT. If 
we sum over all volumes and pass to the limit as the maximum AT 0, 
we obtain 

" f -dd = /// V-fdT (2.88) 



Iff 



Equation (2.88) is the divergence theorem of Gauss. It relates a 
surface integral to a volume integral. It has tremendous applications 
to the various fields of science. In the derivation of (2.88) use has been 
made of the fact that for each internal dd there is a dd, so that all 
interior surface integrals cancel in pairs, leaving only the boundary 
surface S as a contributing factor. 

Equation (2.88) may be interpreted as follows: Any vector field f may 
be looked upon as representing the flow of a fluid, f = pv. From Sec.. 
2.13 V f represents the loss of fluid per unit volume per unit time. The 

total loss of fluid per unit time throughout V is f/J V f dT. Now if 

f and V f are continuous in F, there cannot be any sources or sinks in F 
which would create or destroy matter. Consequently the total loss of 
fluid per unit time must be due to the fluid leaving the surface S. We 
might station a great many observers on the boundary S, let each observer 
measure the outward flow of fluid, and then sum up their recorded data. 
At a point on the surface with normal vector area dd the component 
of the velocity normal to the surface is v N, and pv dd = f dd repre- 
sents the outward flow of mass per unit time. The total loss of mass 

per unit time is 1 1 f dd, and thus (2.88) is obtained. For a more 

detailed and rigorous proof of Gauss's theorem, see Kellogg, "Founda- 
tions of Potential Theory. " 

Example 2.32. Let - gr/r s , and let F be a region surrounding the origin with 
boundary surface S. We wish to compute // dd. We cannot apply the diver- 



gence theorem to the region V since V is discontinuous at r 0. We overcome 
this difficulty by surrounding the origin by a small sphere S of radius with center at 
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the origin (see Fig. 2.26). The divergence theorem can be applied to the region V 
(V less interior of S sphere) with boundaries S and S. Thus 

//*+//*-///*(?)* 

S S V 

We have seen that V (</r/r 3 ) = 0, r 7* 0, so that 

//"--//** 

,s z 

Now on S, r ** e, r/d = (-r/r) dS, so that (<?r/r 3 ) dd =* (-g/e 2 ) d& and 



Hence 

// E r/d 47rg 

5 

gr/r 8 is the electrostatic field due to a point charge q at the origin. For a con- 




FKJ 2.26 
tinuous distribution of charge of density p in S it can be shown that 



E' 



P UT 



Assuming that the divergence theorem can be applied (see Kellogg's " Foundations of 
Potential Theory" for proof of this fact), then 



IJI 



p dr 



(2.89) 



Since (2.89) holds for all volumes we must have V E = 4irp, provided V E 4-n-p is 
continuous. In a uniform dielectric medium V D = 4irp, D = *E. For magnetism 
one has V B * 0. The equations V D = 4irp, V B = comprise two of Maxwell's 
equations. It can easily be shown that E = VV for an inverse-square force. In 
empty space p = so that V 2 F 0. A great deal of electrostatic theory deals with 
the solution of Laplace's equation, V 2 F * 0. 
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Example 2.33. Green's Theorem. We apply (2.88) to f i = uVv and f 2 t>Vw and 
obtain 



dr - HI (uV*V + Vw V0) dr = /I uVv dd 



/// V (Wu) dr = JNhj (t>V 2 u + Vt; Vw) dr JJvVu-fa 



(2.90) 



Subtracting, we obtain 

= // (uVt> - t;Vw) dd (2.91) 



f 



Equations (2.90) and (2.91) are Green's formulas. 

Example 2.34. A Uniqueness Theorem. Let ^> and ^ satisfy Laplace's equation 
inside a region R, and let <p = ^ on the boundary S of #. We show that <f> = ^. Let 
= <p - $. Hence V 2 = W - VV =" in #, and s on 5. Applying (2.90) 
with w t; = yields 



([ 



eve - 



i 



so that /// (ve - V0) dr = 0. Since V0 is continuous (V 2 is assumed to exist), we 



have ve = in R and B = constant. Thus <p \j/ = constant. Assuming ^> ^ is 
continuous as we approach the boundary, we must have <? = ^, since ^> = ^ on the 
boundary. 

Example 2.35. Let f = /(x, t/, 2) a, where a is any constant vector. Applying (2.88) 
yields 



fdd = HI V (/a) dr - a JJJ 
ff fd " ~ Iff Vfdr 



Hence 

We leave it to the reader to show that 

II dd * f - /77 (V * f ) dr (2.92) 

Example 2.36. A vector field f whose flux I up f dd J over every closed surface 

vanishes is called a solenoidal field. From (2.88) it follows that V f 0. Now 
assume f = V X g, so that V f = V (V X g) = 0. Thus the curl of a vector is a 
solenoidal vector. Is the converse true? If V f 0, can we write f =* V X g? 
The answer is "yes," and we call g the vector potential of f. Notice that g cannot be 
unique for V X (g -f v>) V X g. We now exhibit a method for determining g. 
Let f Xi -f Yj -f Zk, and assume g = ai -f 0j + 7k. We wish to determine g so 
that V X g f provided V f 0. Thus a, ft 7 must satisfy 



78 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

dy __ _ % 

dy dz ** 

|2 - I? _ y (2.93) 

dz dx 



We are to determine a, 0, 7. Assume a * 0. Then 

y _! x 

dy dz 



Of necessity, 

0(x, y, z) - /J (x, y, z) dx + <r(y, z) 

r(x, y, z) - / I 7 (a:, y, x) dx -f r(y, z} 

J XO 

dy a/J /"* /aF . dZ\ , . dr d<r 

Hence -- 



/ x dX , , dr do 
-r- dx -\ 
;o dx dy dz 

since V f 0. Therefore 



We need only choose <r and r so that " A r (xo, y. 2). Let <r = and 

dy dz 

fv 
r(y, z) - / X(aj , y, z) dt/ 

y^o 

x* and j/o are constants of integration. Hence 

g - j 1^ Z(x, y, z) dx + k [r( ?/ , ^) - |J F(x, |/, z) dx J + V* (2.94) 

where r(y, z) is defined above and <f> is arbitrary. 

For example, let f * x 2 i xyj xzk so that V f * 0. Choosing xo yo * 

yields r(y, z) - ["tidy - 0, f * Z dx - -z f *x dx - ~(zxV2), 
yO ya;o 7o 



and g - (zW2)j -h (t/x 2 /2)k + Vy>. It is to verify that f - V X g. 

Example 2.37. Integration of Laplace's Equation. Let S be the surface of a region 
R for which VV " 0- Let P be any point of R, and let r be the distance from P to 
any point Q in R or on 5. We make use of Green's formula 

dr 
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We choose ^ = 1/r, which yields a discontinuity inside R, namely, 
at P, where r = 0. In order to overcome this difficulty, we proceed as 
in Example 2.32. Surround P by a sphere S of radius e. Using the 
fact that VV = VY = in R 1 (R less the S sphere) yields 



We leave it to the reader to show that, as e - 0, jf (l/r)Vp dd > 

2 

and // ?V(l/r) dt -> -4r^(P). Hence 



(2 ' 95) 

This formula states that the value of <f> at any point P in 72 is deter- 
dned by the value of v? a 
unit vector normal to S. 



*\ 

mined by the value of v? and V^> N = on the surface S, where N is the 

on 



Problems 

1. If f xi yj + (z* - l)k, find the value of // f dti over the closed surface 

S 
hounded by the planes z 0, z 1, and the cylinder y 2 -f ?/ 2 1. 

2. Show that xi -f t/j/x 2 -f- ^ 2 is solenoidal. 



3. Prove that // dd = if S is a closed surface. 

4. Prove that 1 1 dd X f - III V X f dr. 

5. Show that if I f V*> dr - // ^f dd - /// ^>V f dr. 

6. If w = -g-V X v, v * V X u, show that 

rrr 

u wdr 



V Iff v * dr =-* ff (uXv)'dt 

7. If v = v>, VV *= 0, show that for a closed surface /// v 2 dr // ^>v rfd. 

8. If fi and f 2 are irrotational, show that f i X ft is solenoidal. 

9. Find a vector g such that yzi - zx] 4- (z* -f y 2 )k - V X g. 

10. Find a vector g such that r/r s V X g. 



11. If - pv dd - [ dT for a11 surfaces, show that + V ( P v) - 0. 

12. Find a vector f such that V*f-2z+y-l,VXf-a. 
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CHAPTER 3 

TENSOR ANALYSIS 



3.1. Introduction. In this chapter we wish to generalize the notion 
of a vector. In Chap. 2 the concept of a vector was highly geometric 
since we looked upon a vector as a directed line segment. This spatial 
concept of a vector is easily understood for a space of one, two, or three 
dimensions. To extend the idea of a vector to a space of dimension 
higher than 3 (whatever that may be) becomes rather difficult if we 
hold to the simple idea that a vector is to be a directed line segment. 
To avoid this difficulty, we look for an algebraic viewpoint of a vector. 
This can be done in the following manner: In Euclidean coordinates the 
vector A can be written A = Aii + Azj + A&. We can represent the 
vector A by the number triple (Ai, A%, As) and write A = (Ai, A 2 , A*). 
The unit vectors i, j, k can be represented by the triples (1, 0, 0), (0, 1, 0), 
(0, 0, 1), respectively. We define addition of number triples and multi- 
plication of a number triple by a real number a as follows: 

(Ai, A,, A,) + (Bi, B 2 , B,) = (A l + Bi, A 2 + B,, A* + B,) ( . 
a(Ai, A,, A,) - (aAi, aA 2 , aA 3 ) ^ ' 



Equations (3.1) define a linear vector space. We note that 

(Ai, A,, A 8 ) = Ai(l, 0, 0) + A,(0, 1, 0) + A,(0, 0, 1) (3.2) 

The elements AI, A 2, As of A are called the components of the number 
triple. 

Throughout this chapter we shall use the summation convention, and 
at times, for convenience, the superscript convention, of Chap. 1. To 
continue our discussion of vectors, let A = (Ai, A 2 , A 8 ), B = (B 1 , B 2 , B 8 ) 
be two vectors represented as triples. We define the scalar, or inner, 
product of A and B by 

A B = A a B (3.3) 

The square of the norm (or length) of the vector A is defined to be 
L 2 = A a A, Ai = A\ i = 1, 2, 3. If L 2 = 1, A is a unit vector. The 
cosine of the angle between two vectors is defined by 

81 
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(3 ' 4) 



It is not difficult to show that |cos 0| ^ 1. We must show that 

A^BjB* ^ (A a BY (3.5) 

Let us consider 

3 ^ 

y = (A a x - )> (3.6) 



for x real. 

Now y = ^l^z 2 2A a JS a o: + BjB J represents a parabola. Since y ^ 
for all real x, y = has no real roots or two equal real roots. Hence 
(3.5), the Schwarz-Cauchy inequality, holds. Let the reader show that, 
if the equality sign holds in (3.5), then A a = \B a , a = 1, 2, 3. If 
X > 0, cos 6 = 1. If X < 0, cos = 1. If cos 6 = 0, that is, 

A a B = 

we say that A and B are orthogonal. 

We can define the vector, or cross, product of A and B algebraically 
as follows: Let A\ B\ i = 1, 2, 3, be the components of A and B, respec- 
tively. Let Ck, k = 1, 2, 3, be the components of the number triple 
C = (Ci, C 2 , C 8 ), where 

C k = t, 3k A'B> (3.7) 



The epsilons of (3.7) were defined in Sec. 1.2. We note that 



C\ = A*B l - 

C 8 = A 1 B* - A*B> 

Let us consider the scalar product of A and C. We have 
A-C = A k C k = <- lJk A*A k B> 



But k ji = -;* so that A C = 0. Similarly B C = 0. 
If A = (Ai(t), A 2 (0, -Ai(O), then is defined to be the 



T^ 1J TT T^ y T' 

* ; / * ; * * ; 



-rr> -rr ) If ^(^S 3 2 ^ 8 ) is a scalar function of . 
at at / 

the gradient of <f> is defined to be the triple 

l,A A,* \ 

(3.9) 
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It is easy to define the divergence and curl of a vector in terms of 
number triples. Let the reader show that 



yields the conventional V X A. The divergence of A is the scalar 

A A 

V'A = ^ a (3.11) 

The definitions above refer to Euclidean coordinates. 

In the above presentation geometry has been omitted. Everything 
depends on the rules for manipulating the number triples. One need 
only define an n-dimensional vector as an n-tuple 

A = (A l9 A,, A,, . . . , A n ) (3.12) 

The definitions for manipulating triples are easily extended to the case 
of n-tuples. There is some difficulty in connection with the vector 
product and the curl. This will be discussed later. 

Let us jhope that the reader does not feel that it is absolutely necessary 
to visualize a vector in a four-dimensional space in order to speak of such 
a vector. He may feel that an abstract idea can have no place in the 
realm of science. This is not the case. No one can visualize a four- 
dimensional space. Yet the general theory of relativity is essentially 
a theory of a fbur-dimensional Riemannian geometry. 

One further generalization before we take up tensor formalism. Let S 
be a deformable body^tfhich is in a given state of rest. Let P(x 1 9 x z , # 8 ) 
be any point of this body. Now assume that the body is displaced from 
its position to a new position. Let 8i(x l , z 2 , x 8 ), i = 1, 2, 3, represent the 
displacement vector of the point P. The displacement vector at a nearby 
point Q(x 1 + dx 1 , re 2 + dx\ z 8 + rfx 8 ) is 



*,*')+ d* (3.13) 

except for infinitesimals of higher order. We can write 

2*. = I (*** + M + 1 (* _ M n i-n 

d* ~ 2 \to + 5/ + 2 Vto a?/ ( ; 

The nine terms of 5 f T~ . + T~ V i, j = 1, 2, 3, are highly important in 

deformation theory. It is convenient to represent these nine terms as 
the elements of a 3 by 3 matrix. A simple generalization tells us that 
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we may wish to speak of a large collection of elements represented by 

r*J'" k ( r l r 2 r n\ 

db"-c\X >>>*'/ /g j\ 

t,j, . . . , fc, a, 6, . . . , c = 1, 2, 3, . . . , n 

This is our first introduction to tensors. Before we speak of ten- 
sors, we must say a word about coordinate systems and coordinate 
transformations. 

The totality of n-tuples (x 1 , x 2 , . . . , x n ) forms the arithmetic n-space, 
the x* real, i = 1, 2, . . . , n. By a space of n dimensions we mean any 
set of objects which can be put into one-to-one correspondence with the 
arithmetic n-space. Thus 

P <-> (x 1 , x 2 , . . . , x) (3.16) 

The correspondence (3.16) attaches an n-tuple to each point P of our 
space. We look upon (3.16) as a coordinate system imposed on the 
space of elements P. We now consider the n equations 

y* = yK* 1 , **, , * w ) t = 1, 2, . . . , n (3.17) 

and assume that we can solve for the x l so that 

(3.18) 



We assume that (3J7) and (3.18) are single- valued. The reader may 
read that excellent text, "Mathematical Analysis/ 7 by Goursat-Hedrick 
on the conditions imposed upon (3.17) in order that (3.18) exists. The 
n-space of which P is a point can also be put into one-to-one correspond- 
ence with the set of n-tuples of the form (y l , y 2 , . . . , y n \ so that a new 
coordinate system has been imposed on our n-space. The point P has 
not changed, but we have a new method for attaching coordinates to the 

elements P of our n-space. It is for this reason that (3.17) is called a 

transformation of coordinates. 

3.2. Contravariant Vectors. We consider the arithmetic n space and 

define a space curve, F, fri this V n by 

r i __ r t(A 1 O n 

p. x x\i) i i, ^, . . . , n ,g g. 

In a 3-space the components of a vector tangent to the space curve are 
~W ~HT' ~W Generalizing, we define a tangent vector to the space curve 

(3.19) as the n-tuple f -^-, -, - - , ^Y written 

dx i 

-57 i - 1, 2, . . . , n (3.20) 
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The elements of (3.20) are the components of a tangent vector to the space 
curve (3.19). Now let us consider an allowable (one-to-one and single- 
valued) coordinate transformation of the type given by (3.17). We 
immediately see that 

y^ = y i (x ^ x , . . . , x") = y^(t\ x*(t\ . . . , a?(0] = tf(t) (3.21) 

i = 1, 2, . . . , n. Equation (3.21) represents the space curve F as 
described by the y coordinate system. An observer using the y coordi- 
nate system will say that the components of the tangent vector to F are 
given by 

^ t = 1, 2, . . . , n (3.22) 

Needless to say, the x coordinate system describing F is no more 
important than the y coordinate system used to describe the same curve. 
Remember that the points of P have not changed : only the description of 

these points has changed. We cannot say that ( j-> -^ > -%- J 

/ /77/1 (JlJ (HlJ \ 

is the tangent vector any more than we can say that I r > -|-> > - ) 

\ at at (it / 

is the tangent vector. If we were to consider all allowable coordinate 
transformations, we would obtain the whole class of tangent elements, 
each element claiming to be a tangent vector for that particular coordinate 
system. It is the abstract collection of all these elements which is said 
to be the tangent vector. We now ask what relationship exists between 
the components of the tangent vector in the x coordinate system and the 
components of the tangent vector in the y coordinate system. We easily 
answer this question since 

dj_wdx^ 

dt ~ dx dt * - 1, 4 , n (4-M) 

dip dy a 

We note that, in general, -jr depends on every =- since the index a 

is summed from 1 to n. We leave it to the reader to show that 



dt dy dt ' 

We now make the following generalization: The numbers A l (x l , x 2 , 
. . , x n ), i = 1, 2, . . . , n, which transform according to the law 



~ A'(x\ x*, . . . , *) i=l,2, ... ,n 

(3.25) 
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under the coordinate transformation 

x l = &(x l , x\ . . . , x n ) i = 1, 2, . . . , n (3.26) 

are said to be the components of a contravariant vector. The contra- 
variant vector is not just the set of components in one coordinate system 
but is rather the abstract quantity which is represented in each coordinate 
system x by the set of components A'(x). 

One can manufacture as many contravariant vector fields as one desires. 
Let (A l (x), A 2 (x), . . . , A n (x)) be any n-tuple in an 

x = (x*, :r 2 , . . . , rr) 

coordinate system. In any x coordinate system related to the x coordi- 
nate system by (3.26), define the A 1 , i = 1, 2, . . . , n, as in (3.25). 
We have constructed a contravariant vector field by this device. If the 
components of a contravariant vector are known in one coordinate system, 
then the components are known in all other allowable coordinate systems, 
by (3.25). A coordinate transformation does not yield a new vector; it 
merely changes the components of the same vector. We say that a 
contravariant vector is an invariant under a coordinate transformation. 
An object of any sort which is not changed by transformations of coordi- 
nates is called an invariant. If the reader is confused, let him remember 
that a point is an invariant. The point does not change under a coordi- 
nate transformation; the description of the point changes. 

The law of transformation for a contravariant vector is transitive. 
Let 



- r 

Then ^- 

which proves our statement. 

Example 3.1. Let X, Y t Z be the components of a contravariant vector in a 
Euclidean space for which ds 2 = dx 2 + dy 2 + dz 2 . The components of this vector 
in a cylindrical coordinate system af e 



i 

dx dy dz 

where r (x 2 + y 2 )*, 6 tan" 1 (y/x), z z. Notice that the dimension of is not 
the same as the dimensions of X, F, and Z. The quantity r9 has the correct dimen- 
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sions. (Rj rO, Z) are the physical components of the vector as distinguished from the 
vector components (R, 0, Z). R, rO, and Z are the projections of the vector 

A - Xi + Fj + Zk 

on the unit vectors e r , e0, e t = k, respectively. 

Problems 

1. Show that, if the components of a contravariant vector vanish in one coordinate 
system, they vanish in all coordinate systems. 

2. If A 1 and B* are contravariant vector fields (the A* and B l , i 1, 2, . . . , n, are 
really the components of the vector fields), show that O A* B* is also a contra- 
variant vector field. 

3. What can be said of two contravariant vectors whose components are equal in one 
coordinate system? 

4. If X, Y, Z are the components given in Example 3.1, find the components in a 
spherical coordinate system. By what must 6 and 4 be multiplied to yield the 
physical components? 

5. Let A 1 and B* be the components of two contravariant vector fields. Define 
C*'(x) - A*(x)B>(x), C(x) = A*(x)B'(x). Show that 



6. Referring to Prob. 5, show that 



3.3. Covariant Vectors. We consider the scalar point function 

<p(x l , x 2 , . . . , x n ), which is assumed to be an absolute invariant in that 
under the allowable coordinate transformation x 1 = &(x l , x 2 , . . . , x w ), 
i = 1, 2, . . . , n, 



(3.27) 

where x l (x) = x l (x l , x 2 , . . . , x n ), t = 1, 2, . . . , n, is the inverse 
transformation of (3.26). We form the n-tuple 



d ^ d ^ d <? \ 

1 ' a?' ' ' ' J a?y 



/o ofi^ 

(3 ' 28) 

which is an obvious generalization of the gradient of <p. Differentiating 
(3.27) yields 



Equation (3.29) relates the components of grad <p in the x coordinate 
system with the components of grad (# = <p) in the 5 coordinate system. 
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More generally, the numbers Ai(x l , x 2 , . . . , x n ), i = 1, 2, . . . , n 
which transform according to the law 

dx a 
A*(x\ V, . . . , S) = A a (x\ z 2 , . . . , x) (3 3Q) 

i = 1, 2, . . . , n 

are said to be the components of a covariant vector field. The remarks 
of Sec. 3.2 concerning contravariant vectors apply here as well. One may 
ask what the difference is between a covariant and a contravariant vector. 
It is the law of transformation. Compare (3.25) with (3.30)! The 
reason that no such distinction was made in Chap. 2 will be answered in 
Sec. 3.7. 

3.4. Scalar Product of Two Vectors. Let A l (x) and B t (x) be the com- 
ponents of a contravariant vector and a covariant vector. We consider 
the sum A a B a . What is the form of A a B a in the x coordinate system? 
The letter x is an abbreviation for the set Or 1 , x 2 , . . . , x n ). Now 



B a 

7* dX a i 



so that A a B a 



= A"B a 

Hence the form of A a B a remains invariant under a coordinate transforma- 
tion. This scalar invariant is called the scalar, dot, or inner product 
of the two vectors A and B. 

Problems 

rf 7 A dX a , A , A . r dX a 

1. If A t = A a -rr- show that A t A a r - 

dx l dx l 

2. If <p and ^ are scalar invariants show that 



grad (^) * <p grad ^ -f ^ grad ^ 
grad F(<?) = F'(^) grad <f> 

3. If At and B are the components of two covariant vector fields, show that 
C t; = A t Bj transforms according to the law 



4. Show that CJ = A*5, transforms according to the law (?j() == CJ(x) r-^- 

A* and B are the components of a contravariant vector and covariant vector, respec- 
tively. 
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6. Assume A l (x)B v (x) A l (x)B t (x) for all contravariant vector fields A*. 
that B t is a covariant vector field. 

Sa^pr Show that 
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Show 



6. Let 



3.5. Tensors. The contravariant and covariant vectors defined above 
are special cases of differential invariants called tensors. The com- 
ponents of the tensor are of the form 



where the indices ai, a 2 , . . . , a r , frl, 62, . . . , b, run through the values 
1, 2, . . . , n and the components transform according to the rule 



dx* 



(3.32) 



The exponent A/" of the Jacobian 



dx 
'dx 



is called the weight of the tensor. 



If N = 0, we say that the tensor field is absolute; otherwise the tensor 
field is relative of weight AT. For N = 1 we have a tensor density. 
The tensor of (3.32) is said to be contravariant of order r and covariant 
of order s. If s = (no subscripts), the tensor is purely contravariant, 
and if r = (no superscripts), the tensor is purely covariant; otherwise 
we have a mixed tensor. The vectors of Sees. 3.2 and 3.3 are absolute 
tensors. If no indices occur, we are speaking of a scalar. 

At times we shall call T^l'.'.'.^(x) a tensor, although strictly speaking 
the various T'& are the components of the tensor in the x coordinate 
system. 

Two tensors are said to be of the same kind if the tensors have the 
same number of covariant indices, the same number of contravariant 
indices, and the same weight. Let the reader show that the sum of two 
tensors of the same kind is again a tensor of the same kind. 

We can construct further tensors as follows : 

(a) The sum and difference of two tensors of the same kind are again 
a tensor of the same kind. 

(b) The product of two tensors is a tensor. We show this for a special 
case. Let 



so that 



J 




dx 


2 Ta dx? dx* 


J. J 




dx 


3 dx> dx" 


5" 


= 


dx 
dx 


* dx k dx 1 
dx dx r 


(f}S kl ) = 


dx 
dx 


6 (T*W dX * d * t d * hd * 1 
( * ' dx>' dx dx* dx T 
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(c) Contraction. Let us consider the absolute mixed tensor A}. We 
have 

^) = ^(*)gg (3-33) 

The elements of A] may be looked upon as the elements of a matrix. 
Let us assume that we are interested in the sum of the diagonal terms, 
A\. In (3.33) let j = i, and sum. We obtain 



= A(x) = A\(x) 

Hence A\(x) is a scalar invariant (invariant in both form and numerical 
alue). From the mixed tensor we obtained a scalar by equating a super- 
3ript to a subscript and summing. Let us extend this result to the 

bsolute tensor A^ lm . We have 



Iv = A . 

klm pffT dx a dx* dx k dx l dx m 



Now let / = j, and sum. We obtain 



^ 
^ Q_ k 



= A a* x x x 

~ pffT ak ^ 



pffT dx~" dx 1 

A e ^!^!^! 
pffr dx dx k dx m 



= 

ftOT dx" dx k dx m 

so that A T are the elements of an absolute mixed tensor. We may 
write B a pr == A T since the index o- is summed from 1 to n and hence 
disappears. Notice that the contravariant and covariant orders of B a pr 
are one less than those of Aj$ m . It is not difficult to see why this method 
of producing a new tensor from a given one is called the method of 
contraction. 

(d) Quotient Law. One may wish to show that the elements B/(z) 
are the elements of a tensor. Assume that it is known that A 4 Bj k is a 
tensor for arbitrary contravariant vectors A\ Then 



dx k 
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SO 



that A i ( Bjk Bp y j- 1 = 0. Since A* is arbitrary, we must have 



This is the desired result. If A*Bi..'. is a tensor for arbitrary A*, then 
fijb .'.'.' is a tensor. This is the quotient law. 

Excynple 3.2, The Kronecker delta 6] (see Sec. 1.1) is a mixed absolute tensor, for 



t ax' ox** __ ox* ox^ ax* ^ - 
> dx" Jx 7 ~ ~d dx* " d " a 



If ^(a: 1 , a; 2 , . . . , x n ) = ^(ar) is an absolute scalar, tf>(x) = $(x), then ^55 is an 
absolute mixed tensor whose components in the x coordinate system equal the corre- 
sponding components in the x coordinate system. Conversely, let Aj(x) be an 
isotropic tensor, that is, A](x(x}} = A](x). Theix 

. , . _ dx 1 dx" 



fix" dx> 



Since - a can be chosen arbitrarily, it follows that 

uX 



for all i, j, a, <r. Choosing i 5^ j and a a, it follows that A] 
choose ?' = j, a <r, no summation intended. Then A 
occurring. Hence the diagonal elements of A] are equal to 
A* = ^>5}. Let the reader show that if A^ t is isotropic then ^ 




f. Now 

nmation 
^o that 



Example 3.3. Let ^,(x) be the components of an abglmite covanant tensor so that 



and 



If gr a<r = g ffai we have, upon taking determinants, that 



and 



Now if A T are the components of an absolute contravariant vector, then A 1 ** A a 



and 



fix" 



da: 



dx_ 
dx 
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Thus B i is a vector density. This method affords a means of changing absolute 
tensors into relative tensors. 

Example 3.4. Assume g a & dx a dx& is an absolute scalar invariant, that is, 

g a $ dx dx* = g a dx a dx? 
dX a 

Moreover we assume that g a p g$a. From dx a - d& we have 

Gap 1~ !^ dx dx - g v dx dx* 

for arbitrary da;'*. It follows from Example 1.2, that 



Hence the 0a/s(z) are the components of an absolute co variant tensor of rank 2 (two 
subscripts). 

Example 3.5. Let Ai and # t be absolute covariant vectors. Let the reader show 
that 



is an absolute covariant tensor of rank 2 and that C t j = C/. For a three-dimen- 
sional space 

A iB'2 - 

2 _ A 2 Bi) 





The nonvanishing terms correspond to the components of the vector product. 

Problems 

1. If the components of a tensor are zero in one coordinate system, show that the 
components are zero in all coordinate systems. 

dx a dx& 

2. If 0*0 - gp a and g t) = g a p > show that $ t , = g jt . 

3. From g a & = ?(ga0 -f 00) H- ?(gafi - gfa), show that 

gap dx a dxP - j(g a p + gp a ) dx dx* 



4. If Ay t is an absolute tensor, show that A\\ is an absolute scalar. 
6. If A a p is an absolute tensor, and if A a ^Ap y 6", show that A"P is an absolute 
tensor. The two tensors are said to be reciprocal. 

6. Show that the cof actors of the determinant |o,| are the components of a relative 
tensor of weight 2 if o u - is an absolute covariant tensor. 

7. If A* is an absolute contravariant vector, show that r-j is not a mixed tensor. 

8. Assume A]/ t is an isotropic tensor. Show that (3.34) holds. 

dx 



9. Use (1.26) and |^,l - I0.il 



dx 



to show that |0,/|t ci,k is an absolute tensor. 
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3.6. The Line Element. For the Euclidean space of three dimensions 
we have 

<fe 2 = dx z + dy* + dz* (3.35) 

The simple generalization to a Euclidean n-space yields 



ds* = (dx 1 ) 2 + (dx 2 ) 2 + + (dx n ) 2 

= b a ft dx a dx? 5 a/3 = if a ^ 0; S^ = 1 if a = |8 (3.36) 

If we apply a coordinate transformation, x i = x*(x l , # 2 , . . . , n ), 

dx a dx& 

i = 1, 2, . . . , n, we have dz<* = - d&, dx* = - d$ v , so that (3.36) 

uX uX 

takes the form 



= (7 M , dx" dx" (3.37) 

3 

5x a dx^ \ A Qx a dx a 
where ^ Mv = 60 ^ ^ = y ^ ^' Riemann considered the general 

a = l 

quadratic form 

ds* = gafi(x) dx" dx* (3.38) 

This quadratic form (the line element ds 2 ) is called a Riemannian 
metric. The g a $(x) are the components of the metric tensor (see Exam- 
ple 3.4). A space characterized by the metric (3.38) is called a Rie- 
mannian space. 

Theorems regarding this Riemannian space yield a Riemannian geom- 
etry. Given the form (3.38), it does not follow that a coordinate trans- 
formation exists such that ds* = d a p dy a dy&. If there is a coordinate 
transformation x % = x l (y l , y 2 , . . . , y n ), such that ds z = d a pdy a dyP, we 
say that the Riemannian space is Euclidean. The y coordinate system is 
said to be a Euclidean coordinate system. Any coordinate system for 
which the g t} are constants is called a cartesian coordinate system (after 
Descartes). 

We can choose the metric tensor symmetric, for 



and Tj(Qij ~ 9ji) dx 1 dx 3 = 0. The terms ^(g l} + g i% ) are symmetric. We 
assume that the quadratic form is positive-definite (see Sec. 1.6). 

Example 3.6. In a tl^ree-dimensional Euclidean space using Euclidean coordinates 
one has 

- (dx 1 ) 2 -f (dx 2 ) 2 4- (dx 8 ) 2 
100 



so that \\ga\\ - 



010 
001 
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For spherical coordinates 

x l r sin cos <? y l sin y* cos y* 
a; 2 = r sin sin <p y 1 sin g/ 2 sin y l 
x l = r cos 6 = y l cos 2/ 2 

The ^ t -, of the spherical coordinates system can be obtained from 

Ai*a AI& 

d(y l , y*, 2/ 8 ) 
/ a. 

Hence d\\ - 



We obtain 




ain ?/ 2 ) 2 
for ? ?^ 



- (di/i) 2 4- 

- dr 2 -f r 2 



^(dy^ + to 1 sin 7/ 2 ) 2 (%) 2 
-f r 2 sin 2 



This spherical coordinate system is not cartesian since #22 5^ constant. 

Example 3.7. We define $7*'' as the reciprocal tensor to g %1 , that is, g ll gjk 5j (see 
Prob. 5, Sec. 3.5). The g* } are the signed minors of the g n divided by the determinant 
of the gtj. The g lj are the elements of the inverse matrix of the matrix \\g %J ||. For the 
spherical coordinates of Example 3.6 we have 



1 







1 





r 2 







1 









II0"II - 


r 2 







r 2 sin 2 8 







] 


r 2 sin 2 



Example 3.8. We define the length L of a vector A* in a Riemarmian space by the 
quadratic form 

L 2 = gc<0AA 8 (3.39) 

The associated vector of A* is the covariant vector 



It is easily seen that A' 0*0 A0 so that 

L 2 - a/ jAA* 
The cosine of the angle between two vectors A*, B* is defined by 



Let the reader show that |cos 0\ 1. 

Problems 

1. Prove (3.40). 

2. Show that |cos 9\ 1, cos e defined by (3.41). 



(3.40) 



(3.41) 
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3. For paraboloidal coordinates 

X l .. ylyt C0 g yl 

X* yly* S in y 

X 8 . l. [(y l), _ (,)] 

Show that 



4. Consider the hypersurface x* &(u l t w 2 ) embedded in a Riemannian 3-space. 
If we keep u l fixed, u l wj, we obtain the space curve x l x(wj, w 2 ), called the w 2 
curve. Similarly a;* = x i (u l 1 ul) represents a u\ curve on this surface. These curves 
are called the coordinate curves of the surface. Show that the metric for the surface 

is ds z h%t du* du j ', where hit g a & T~* ^~r* Show that the coordinate curves 

OU QU 1 

intersect orthogonally if hu = 0. 
6. The equation p(x l , x*, . , . , a^) = determines a hypersurface of a V n . Show 

that the -~ are the components of a covariant vector normal to the surface. 

oX 

6. Show that -r- is a unit tangent vector to the space curve x t (s) ) d 2 g a p dx a dxP. 

7. Show that the unit vectors tangent to the u l and w 2 curves of Prob. 4 are given 

, , i d* , dx* 

by An"* ^22~* r 

dw 1 dW 2 

8. If ^ is the angle between the coordinate curves of Prob. 4, show that 

cos 6 



3.7. Geodesies in a Riemannian Space. If a space curve in a Rie- 
mannian space is given by x { = z*(), we can compute the distance 
between two points of the curve by the formula 



The geodesic is defined as that particular curve x*(t) joining x^to) and 
x^ti) which extremalizes (3.42). The problem of determining the 
geodesic reduces to a problem in the calculus of variations. We apply 
the Euler-Lagrange formula to (3.42) (see Sec. 7.6). The differential 
equation of Euler-Lagrange is 



where / = (g^&&)*. Now 



d^ ~ 2/ dx i 
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If instead of using t as a parameter we switch to arc length s, then t = s 
and/ = 1. Hence 

df _ 1 

dx* ~ 2 



so that (using the fact that gy = ##) (3.43) becomes 



Multiplying by g and summing on i yields 

dV -I- 4 ( d0a{ + *&>- d9a/> } dx " dx<> - H 441 

d? + ** \d? + da" to*/ W dF ~ (d>44j 

d 2 x r , _ dx" da^ _ , rt 

or d^ + r ^ -rf7 dF " T = 1, 2, . . . , n 

(Xo 060 U'O 

where r S ,.^ + _ (3.44') 



Equations (3.44) are the second-order differential equations of the geo- 
desies or paths. The functions F^ are called the Christoffel symbols of 
the second kind. 

Example 3.9. For a Euclidean space using cartesian coordinates we have g %J = 
constant so that -^ = and (3.44') yields r^ = 0. Hence the geodesies are given by 



-ry = 0, or X T = a T s + b T , linear paths. 

Example 3.10. Assume that we live on the surface of a right helicoid immersed in a 
Euclidean 3-space, ds* = (dx 1 ) 2 + [(x 1 ) 2 + c 2 ](dz 2 ) 2 . We have 



Applying (3.44'), we have 



Since g 21 unless i 2, we have 



Similarly, 



nl f\ fit ^ f\ Til -pi __ f\ fil _j -p2 __ /\ ip2 __ -p2 

1 11 Ui r n ** v, r 12 r 2 i o, r 22 -x x , r 22 u, r 12 1 2l r , n2 , 
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The differential equations of the geodesies are 



2*1 dxldx 2 



ds z ^ (x 1 ) 2 4- 
Integrating the second of these equations yields 

-j- [(x 1 ) 2 + c 2 ] - constant = A 

Let the reader show that ( r~ ) 4- 7 rv^ ; ; constant. 
\ as / (x 1 ) 2 + c 2 

Problems 

1. Derive the Tg T of Example 3.10 

2. Find the differential equations of the geodesies for the line element 



dx l dx* 

Integrate these equations with initial conditions x 1 = On, x 2 <po, ~T- 1, -5 at 

8 = 0. 

3. Show that Y r a8 = rj fl . 

4. Obtain the Christoffel symbols and the differential equations of the geodesies for 
the surface 

x 1 = u l cos u z 
x 2 = u l sin u z 
x 3 = 

The surface is the plane x 8 = 0, and the coordinates are polar coordinates 



5. From (3.44') show that 



6. Let ds 2 - i7 dw 2 + 2F rfw dv + G dv*. Calculate |^,|, ||^ ||, r; fc . 

7. If f},(5) = rj 7 (x) g g ^ + Jg-, g, show that rj 7 - T ? , is a tensor. 

8. Obtain the Christoffel symbols for a Euclidean space using cylindrical and 
spherical coordinates. 

9. The Christoffel symbols of the first kind are (t, jk] = g^jk' Show that 
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3.8. Law of Transformation for the Christoffel Symbols. Let the 

equations of the geodesies be given by 

d*x % , _ . , N dx 3 dx k ~ /0 . CN 

w + T * {x} Ts dF - (3 ' 45) 

j <**# i r, ,-,d2'dx k . , .... 

and ^ + !%(*) Tg -^ = (3.4(0 

for the two coordinate systems x\ x l in a Riemannian space. We now 

find a relationship between the rj* and the T] k . From x = x*(x) we have 



= 
ds ~ a^ a ds ds 2 " dx^ dx a ds ds dx a ds 2 

Substituting into (3.46) yields 

^x* dV 6 2 ff dx^ dx* - d| d^ dx^ dx^ = 
aa: a ds 2 ds* dx ds ds + ?fc d^ a dxf ds ds 

dx ff 
Multiply this equation by t and one obtains (after summing on i) 



- = 

3k dx dx<* d# + dx? dx" dx>) ds ds 



Comparing with (3.45), we have 



Equation (3.47) is the law of transformation for the Christoffel symbols. 
Note that the Tj k are not the components of a mixed tensor so that T) k (x) 
may be zero in one coordinate system but not in all coordinate systems. 
Let the reader show that 



Example 3.11. From Prob. 4, Sec. 1.2, and the definition of the tf* we have 



From Prob. 5, Sec. 3.7, 

TjjjT s 

so that ~^! 



Hence ft (3.49) 
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Example 3.12. Let us consider a Euclidean space using Euclidean coordinates. In 
this coordinate system T] k (x) 0. From (3.48) 

P ,*x . d ** ^ 

T > k(x} k ~ 



for the x coordinate system. If the T] k also vanish, then ^ of necessity, so 

that 

x ff - aj -f 6" 

where a*, b* are constants of integration. Hence the coordinate transformation be- 
tween two cartesian coordinate systems is linear. If the transformation is orthogonal, 

n 

y >? - &* 

a= 1 

[see (1.53)]. For orthogonal transformations g* } = g a p -r-r; -rr- reduces to 

CrX oX 



., . . d' dx" 

80that 6 " = ^- 



Now let us compare the laws of transformation for oovanant and contravariant 
vectors. We have 



Making use of (3.50) yields 

n 
\*\ .a r a 

(3.52) 



cr-1 

so that orthogonal transformations affect contravariant vectors m exactly the same 
way covariant vectors are affected. This is why there was no distinction made 
between these two types of vectors in the elementary treatment of vectors. 

Problems 

1. Prove (3.48). 

2. By differentiating the identity g tct g a j = 8) show that 



3. Show that the law of transformation for the Christoffel symbols is transitive. 

4. Derive the law of transformation for the Christoffel symbols from 
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6. Define g* f (x) *e p(x)gi,(x). Show that 



where <p a ~ and r*J, TJ fc are the Christoffel symbols for the {/*,-, #*,, respectively. 

ft uX 

3.9. Covariant Differentiation. Let us differentiate the absolute covar- 
iant vector given by the transformation 



~>- ~'dF 

with respect to an absolute scalar t = t. We obtain 
dAj _ dA dx 4 aV dx k 



i a 
since t = t . It is at once apparent tha,t P-^- Jis not a covariant vector. 

\j GfL I 

We wish to determine a vector (covariant) which will reduce to the ordinary 
derivative in a Euclidean space using Euclidean coordinates. We accom- 
plish this in the following manner by making use of the transformation 

- dx k dx* dx k 

law for the Christoffel symbols: Multiply (3.48) by A l -^ = A M -^-jj- 

to obtain 

. dx k dx dx* dx-* d& d3 k . dxd& d*x* dx k 



= rj Y A a -^- + 
Subtracting from (3.53) yields 



dA dx^ 

Hence --^~ T% y A a -rr is a covariant vector. 

For Euclidean coordinates rj 7 s= so that this covariant vector becomes 

the ordinary derivative rr- We call TT " rj^A a =7- the intrinsic 

ac at MT at 

derivative of A^ with respect to t. Its value depends on the direction 

we choose since it involves ~rr- We write 

at 



. dt " * dt 
Since 
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is a vector for arbitrary -j-> it follows from the quotient law that 

d^*M p a A /o K fi \ 

l^A. ld.W 

is a covariant tensor of rank 2. We call (3.56) the covariant derivative 
of Ap y and we write 



(3<57) 



The comma in A M(T denotes covariant differentiation. 
We now consider a scalar of weight AT, 



A = 



ax 
dx 



We have 



ax* a 

From Prob. 9, Sec. 1.2, we have 

a 



ax 
ax 



ax 



so that = 



dx 3 dx" 



N dx" 



Multiplying r-, = TL y + ; by 

tracting from (3.58) yields 



n "- dx 



ax 

ax 



(3.58) 



A and sub- 






- NAT' = 
J * 



dx 
dx 

dA 



idA _ NATff \ dx 



Hence the invariant (in form), NAT aa , is a covariant vector of 

ox 

weight N. We write 

A, a SE - NAT ff ao (3.59) 

We call A, a the covariant derivative of A. The comma in A, a denotes 
covariant differentiation. If A is an absolute scalar, N = and 



A -- 

A * ~ 



the gradient of A. 
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In general, it can be shown that if T0 l $:::(x) is a relative tensor of 
weight N, then 



M I rpftixt ar-pai t . . _t_ nrr 

I" ^0101 0.1 Mm -T T I 

meriaz atr-p/t motiaj 
1 ,40, . . . ft, 1 im 



1 ft 



:::zr: m (3.60) 

is a relative tensor of weight AT", of covariant rank one greater than 
r** ''' ^'m is called the covariant derivative of n 2 .'.'-X- 
Since the covariant derivative is a tensor, successive covariant differ- 
entiations can be applied. 

Example 3.13. We apply (3.60) to the metric tensor gr,. We have 



from Prob. 5, Sec. 3.7. 

Example 3.14. Curl of a Vector. Let A t be an absolute covariant vector. We 
have 

A "' * IF ~ ^ r? ' and A - S 1 ~ 4 r "' 

^ A ft A 

so that At,, A,, t = r - r r is a tensor. It is called the curl of A and is a 
ox' ox* 

covariant tensor of rank 2. Strictly speaking, the curl is not a vector but a tensor of 
rank 2. In a three-dimensional space, however, the curl may be looked upon as a 

dto 

vector [see (3.10)]. If Ai = T~> then curl At 0. Conversely, if curl At 0, then 
A. - |^ where [see (2.79)] 

<p(x\ x\ . . . , x) - [* Ai(x\ x*, . . . , a?) da: 1 
yxo 1 

rx* 
+ / Ai(sj, a: 2 , . . . , a; w ) dx* 

J ajo* 

A,,(zJ, xj, . . . , xj- 1 , x) dx n (3.61) 



The constants xj, xj, . . . , xj can be chosen arbitrarily. 

Example 3.15. Divergence of a Vector. The divergence of an absolute contra- 
variant vector is defined as the contraction of its covariant derivative. Hence 

div A* A^ -~ + ATJrt 
From (3.49), r a< - "gft, 

j. ... "A-a i i i i A ^ \9 ' 

dlvA = -ZZT + \<?\~* A "-7T^ 

(3.62) 
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If we wish to obtain the divergence of A,, we consider the associated vector A* g* a A . 
The A" of (3.62) are the vector components of A. To keep div A 1 dimensionally cor- 
rect, we replace the vector components of A* by the physical components of A*. For 
spherical coordinates 



101*- 



1 

Or 2 

r 2 sin 2 6 



r 2 sin 



* 



(r 2 sin $A') + ^ (r 2 sin OA') + -~ 
u \_oT ou o<p 



so that div A* 

_ 

ing to physical components (see Prob. 4, Sec. 3.2) 



div A< - 



(rs sin 



(r 8in 



* sm 



1 and chang- 
J 



Example 3.16. The Laplacian of a Scalar Invariant. The Laplacian of the scalar 
invariant <f> is defined as the divergence of the gradient of <f>- We consider the asso- 
ciated vector of the gradient of <f>, namely, g a & -^ Applying (3.62) to g"& ^ yields 
the Laplacian 



Lap <f> 3= V V = j rr - 



(3.63) 



In spherical coordinates 



= r 2 sin 



* 

r 8m '- 

Problems 



1 










1 



r 2 sin 2 



- 



1. Starting with A* A a i show that A*y 

without recourse to (3.60). 

2. Show that (A%),* - A*B ?f * + A; fc B,. 

3. Show that (0rA),, - ^A a r 

4. Show that |gr<,U = 0. 

5. Show that 5*^ - 0. 

6. Find the Laplacian of V in cylindrical coordinates. 



4. 



is a mixed tensor 



7. Show that 



f + rj< AJ - 



for an absolute mixed tensor AJ without 



recourse to (3.60). 

8. Show that A? a - Jj ^ (|^|Uf ) - A|r? tt . 



9. Write out the form of A", (two covariant differentiations). 

10. If A0c. t) is a covariant vector, show that -^ = - + A tf , -37- Show that 

ot at at 

* 

is the acceleration vector if v i is the velocity vector. 



dt 
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3.10. Geodesic Coordinates. Since the Christoffel symbols Tj k are not 
the components of a tensor, it may be possible to find a coordinate system 
in the neighborhood of x i = q i such that T] k (q) = 0. We now show that 
this can be done. Let 



xu x 
so that 



5} and 



(3.64) 



= 1. Hence (3.64) is nonsingular in a 



neighborhood of x = q. The point x l = q l corresponds to the point 
*' = 0, that is, the point x = q now becomes the origin of the x coordi- 
nate system. Differentiating (3.64) yields 



(3.65) 



since T l ^(q) = T^( 
respect to x k yields 



so that 



Now 



dx k d& 



Thus 5 5 = g 



+ 



- r 

- i 



dx? dx 1 * d& 



Differentiating (3.65) with 



dx* 



and evaluating at z l 



0, yields 



- r;() = o. Q.E.D. 



Any system of coordinates for which (T} k ) P = at a point P IB called 
a geodesic coordinate system. In such a system of coordinates the 
covariant derivative, when evaluated at P, becomes the ordinary deriva- 
tive when evaluated at P. For example, 



-^7] + 



snce (r 
system. 



= 0, if the a; 1 are the coordinates of a geodesic coordinate 
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We now show that 



System (3.66) is true in geodesic coordinates at any point P. But if two 
tensors are equal to each other (components are equal) in one coordinate 
system, they are equal to each other in all coordinate systems. Hence 
(3.66) holds for all coordinate systems at any point P. 

Equation (3.64) yields one geodesic coordinate system. There are 
infinitely many such systems since we could have added 



to the right-hand side of (3.64) and still have obtained 1^(0) = 0. 

A special type of geodesic coordinate system is the following: Let us 
consider the family of geodesies passing through the point P, x* = xj. 

dx l 

Each * = -j- determines a unique geodesic passing through P. This 
ds p 

follows since the differential equations of the geodesies are of second order. 
Suitable restrictions (say, analyticity) on the T* k (x) guarantee a unique 
solution of the second-order system of differential equations when the 
initial conditions are proposed. The & are the components of the tan- 
gent vector of the geodesic through P. We now move along the geodesic 
(determined by *) a distance s. This determines a unique point Q. 
Conversely, if Q is near P, there is a unique geodesic passing through P 
and Q which determines a unique * and s. We define 

x* p 8 (3.67) 

The x l are called Riemannian coordinates. A simple example of Rie- 
mannian coordinates occurs in a two-dimensional Euclidean space. 
There is a unique geodesic through the origin with slope m = tan 6. 
The r coordinate of polar coordinates corresponds to the s of Riemannian 
coordinates. Thus ( x = cos 6, 2 = sin 6), r = s, and x l = r cos 0, 
x 2 = r sin 6. In this case the Riemannian coordinate system (x 1 , x 2 ) 
corresponds to the Euclidean coordinate system (x, y). 

The differential equations of the geodesies in Riemannian coordinates 
are 

d 2 x l , - ,_ N dx j dx k 



But = ft = 0, so that 

ds ' ds 2 ' 

= (3.68) 



Since (3.68) holds at the point P (the origin of the Riemannian coordinate 
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system, 2* = 0) for arbitrary ', it follows that 1^(0) = 0. Hence a 
Riemannian coordinate system is a geodesic coordinate system. 

3.11. The Curvature Tensor. We consider the absolute vector V\ 
Its co variant derivative yields the mixed tensor 

7. = _ 4- VT l 
' 3 dx> ^ a3 

On again differentiating covariantly we obtain 

dV l 
V* = >J 4- Vr i , V* T a , 

Y ,1k k ' v ,j l k Y ,a L jk 



Interchanging k arid j and subtracting yields 



where R^, k = ^ - ^ + ^ a] T' ek - T^T' 9] (3.70) 

A necessary and sufficient condition that V\ lk = V\ kl is that R\, k = 0. 
It follows from (3.69) and the quotient law that R l ajk is a tensor. The 
tensor R^ depends only on the metric tensor of the space. It is called 
the curvature tensor. Its name and importance will become apparent 
in the next two paragraphs. The contracted curvature tensor 

(3.71) 

v*>- u*/ 

is called the Ricci tensor and plays a most important role in the general 
theory of relativity. 

The scalar invariant R = g lJ R^ is called the scalar curvature. The 
tensor 

Rh ~ Qh R*' (3 72) 

is called the Riemann-Christoffel, or covariant curvature, tensor. 

Problems 

1. Show that, for Riemannian coordinates, f j k ()&& 0. 

2. Show that (A] + B]), k - A] tk -f B} tk . 

3. Show that JB, - #, t . 

4. Show that R l akjt0 -f- #,* + /%*,, - 0. This is the Bianchi identity. Hint: 
Use geodesic coordinates. 

6. Show that R\\]k Rikjk "^ Rh\ki> -Rj* m 0, K\jkk ** 0. 

6. If Rvj kgij, show that R nk, n the dimension of the space. 
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3.12. Euclidean, or Flat, Space. If a space is Euclidean, there exists 
a coordinate system for which the g^ are constants, so that r] fc (x) 2= 0, 

dr*- 

-T-J = 0. It immediately follows that the curvature tensor Rj kl vanishes 

(all its components are zero). We now show the converse. If the 
curvature tensor is zero, the space is Euclidean. Let us first note that 
if an x coordinate system exists such that the g tj (x) are constants then 
r* fc (x) = 0, and conversely. If there is an x coordinate system for which 
rjtCr) = 0, then 



and conversely, if (3.73) holds, then T* k (x) = 0. Our problem reduces 
to the following: Given the Christoffel symbols T l jk (y) in any y coordinate 
system, can we find a set of x\ i = 1, 2, 3, . . . , n, satisfying (3.73)? 
The system of second-order differential equations (3.73) can be written 

' (3.74) 



We reduce (3.74) to a system of first-order differential equations. Let 

w " " (3 - 75) 

ftn 
so that ^ = u\T] k (y) (3.7(5) 

For each a we have the first-order system of differential equations given 
by (3.75) and (3.76). These equations are special cases of the more 
general system 



(3.77) 



with fc = 1, 2, . . . , n + 1; j = 1, 2, . . . , n. If we let z l = x, 
z * = ul, . . . , z n+1 = u^ (3.77) reduces to (3.75), (3.76). 

If a solution of (3.77) exists, then of necessity (assuming differenti- 
ability and continuity of the second mixed derivatives) 



, = 
dy l dz* dy l dy> dz dy 3 



+- +* 

If the F k j are analytic, it can be shown that the integrability conditions 
(3.78) are also sufficient that (3.77) has a unique solution satisfying the 
initial conditions z k = zj a ^ 2/ r ^ 111- The reader is referred to the 
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elegant proof found in Gaston Darboux, "Legons systemes orthogonaux 
et les coordonn6es curvilignes," pp. 325-336, Gauthier-Villars, Paris, 
1910. 

The integrability conditions (3.78) when referred to (3.75), (3.76) 
become 

T?M = TfouZ . - . 

RW = o (3 ' 79) 

The first equation of (3.79) is satisfied from the symmetry of the T* k , 
and the second is satisfied if Ufa = 0. Hence a necessary and sufficient 
condition that a Riemaimian space be Euclidean is that the curvature 
tensor vanish. 

3.13. Parallel Displacement of a Vector. Consider an absolute con- 
travariant vector A 1 in a Euclidean space. Using cartesian coordinates 
yields T* k (x) = 0. We assume further that the A % are constant. From 



we have 



T IV * t/t L/ 

__ A a , ._ 

Oft dx a dx y 6 
Moreover 

Ty ff = since 
d*x l dx& dx a 



dx? dx dxt dx 
from Example 1.4. Hence 

dA i = - A'T^ dfr (3.80) 

Now the A\ being constant in a Euclidean space, can be looked upon as 
yielding a uniform or parallel vector field. Equation (3.80) describes how 
the components of this parallel vector field change in various coordinate 
systems. Since, generally, a Riemannian space is not Euclidean, we can 
use (3.80) to define parallelism of a vector field. 

We say that A i is parallelly displaced with respect to the Riemannian 
V n along the curve x i = x l (s) if 
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We say that the vector suffers a parallel displacement along the curve. 
If a vector suffers a parallel displacement along all curves, then from 

fj Ai A A i /7/T0 

tt^l u/1 (IX* . - ,, , 

-j = -7T-R -T- it follows that 
as dx* ds 

~dxP = ~~ A T * 
or A^ = 0. 

Example 3.17. Let us consider two unit vectors A 1 , B l which undergo parallel 
displacement along a curve. We have 



, 
and 



cos = 

5 (cos B) dA D , . . dBP 

_L_2 . W _^ +W A 



since g a p,y = 0, r = 0, - = 0. Hence, if two vectors of constant magnitude 

OS uS 

undergo parallel displacements along a curve, they are inclined at a constant angle. 

Two vectors at a point are said to be parallel if their corresponding 
components are proportional. If A 1 is a vector of constant magnitude, 
the vector B l = <pA\ <p = scalar, is parallel to A*. If A* is also parallel 

8A l 
with respect to the V n along the curve x l = x { (s), we have -r = 0. Now 

OS 



ds * ds ds ds 

--%* 

<p as 

We desire B l to be parallel with respect to the V n along the curve. Hence 
a vector of variable magnitude must satisfy an equation of the type 

^ = f(*)B* (3.82) 

if it is to be parallelly displaced along the curve. 

Example 3.18. The curvature tensor arises under the following considerations: 
Let x l * x l (t), ^ t ^ 1, be an infinitesimal closed path. The change in the com- 
ponents of a contravariant vector on being parallelly displaced along this closed 
path is 



If we expand A", r^ in a Taylor series about x\ = x l (Q) and neglect infinitesimals of 
higher order, it can be shown that 



(3.83) 
where R^ y is the curvature tensor of Sees. 3.11, 3.12. 
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Problems 

1. Show that the Riemannian space for which ds* = dO* -\- sin 2 9 d^ is not Eu- 
clidean. 

2. Derive (3.79). 

3. Show that the unit tangent vector to a geodesic suffers a parallel displacement 
along the geodesic. 

4. If B l satisfies (3.82), show by letting #* = ^A* that it is possible to find ^ so 

that ^ - 0. 
8s 

6. Derive (3.83). 

3.14. Lagrange's Equations of Motion. Let L be an absolute scalar 
invariant of the space coordinates (x 1 , .r 2 , . . . , x n ), their time deriva- 
tives (i, #2, . . . , x n ), and the invariant time t, t = I. Hence 

L(x\ x\ . . . , x, x 1 , z 2 , . . . , x*, t) 

under the transformations 

** = ^(x\x 2 , . . . ,x") t = 1,2, . . . ,n 

= f (3 ' 84) 

From (3.84) 



dx l _ dx 4 a^ a _ dx* _ dx l 
6^~o^6^"~aS" J ~dS 

when it is assumed that the x l and fr are independent variables as far 
as L is concerned. Now 

aL = ^Ldx^ dLfrr* 
d* ~ dx a dx l dx a dx* 



dL d*x 

X 



Also ^ = ~ 

,, , d (dL\ d (dL\ 

SO that -jf [ -^r, ] = ^7 I TT; J 

at \ox / at \ox / 



dL 

di, ~ * 



Subtracting (3.85) from (3.86) yields 
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Equation (3.87) implies immediately that 



dt \dx a / dx a 

is an absolute covariant tensor. 

Let us consider a system of n particles, the mass m,, i = 1, 2, . . . , n, 
located at the point x", a = 1, 2, 3. We assume that the coordinates 
are Euclidean and that Newtonian mechanics apply. Let 

V(x l x 2 x* x 1 x 2 x 3 x l j 2 r 8 ) 

be the potential function such that 

* = "" <dx] 

represents the rth component of the force applied to particle s. In 

d z x r 
Newtonian mechanics F r s = m s -~ for Euclidean coordinates. The 

kinetic energy of the system is defined as 



where g a $ = 5 a/ 9. The Lagrangian of the system is defined by 
L = T - V 

xfx , - V 



Thus - = m a g a rX = 



., . d fdL\ dL .. r , dV n 

80 that a\wj -dx] = m * x * + a*. = 

for Newtonian mechanics. Since -^ { T ) -r-: is a vector, it vanishes 

a< \dxj/ dx r 9 

in all coordinate systems. We replace the x r 9 by any system of coordinates 
(j 1 , g 2 , . . . , g n ) which completely specifies the configuration of the 
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system of particles. Lagrange's equations of motion are 

dL 



#\**i ^ ~ " r - 1, 2, .... n (3.88) 

Example 3.19. We consider the motion of a particle acted upon by the force field 
F = vF. We use cylindrical coordinates. 

L - T - V - m(r 2 -f r 2 2 + 2 2 ) - F 
so that = m(rd*) 

d fdL\ 



and one of Lagrange's equations of motion is 

mr mrd* + -r- = 

OT 

Since r- represents the radial force, the term r r0 2 must be the radial accelera- 

OT 

tion. 

If no potential function exists, or if it is difficult to obtain the potential 
function, we can modify Lagrange's equations as follows: Since the 
kinetic energy is a scalar invariant, we have that 

*-*()- ** 

is a covariant vector. Let the reader show that if the x l are Euclidean 
coordinates, then Q r is the Newtonian force. If / r is the force in a y 
coordinate system, then 

Qr = ia & Qr d * r = /a f? dxr = fa dya 

so that Q r dx r is a scalar invariant and represents the differential of work 
dW, and 



We obtain Q+ by allowing x* to vary by an amount Ao:*, keeping x l , # 2 , 
. . . , x 1 " 1 , x*" 1 " 1 , . . . , x n fixed, calculate the work ATT done by the 
forces acting on the system, and compute 



r t . . , 

Qt = lim i not summed 

* 



Example 3.20. A hoop rotates about a vertical diameter with constant angular 
speed w. A bead is free to slide on the wire hoop, and there is no friction between hoop 
and bead. Gravity acts on the bead of mass m. We set up the equations of motion 
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of the bead, using spherical coordinates. The force of the hoop on the bead is given 
by F = Re r + $6^, R, 3> unknown, and the force of gravity is 

G *B mg cos 0e r + mg sin 0e0 
Hence 

Q r R mg cos 
Q e = mgr sin 
Qp = $r sin 

From T = ^m(r 2 + r 2 2 + r 2 sin 2 0< 2 ), we have 

("a 7 " ) = mr 

fdT\ = d 

: \dd / (ft l ; 



f/y if I u*** v if j 

dT . . rf /dT 7 ' 

- = mr 2 sin cos 2 



- - ^ 

d^> d^ \c 

Equation (3.89) yields 

-57; (r^ 2 + r sin 2 v? 2 ) = g cos 
dt 2 m * 

-T (r 2 ^) - r 2 sin cos B <p* = gr sin (3.90) 

4, (r 2 sin 2 tf>) = - r sin 
at m 

The geometry of the configuration yields r constant ro, r = r 0, co f 
<p = 0. The solution of the problem consists in finding 0(0 by integrating the second 
equation of (3.90). Once this is done, R and 4> can be obtained from the other two 
equations. Let the reader show that 

g-rj ( -17 J -g-rjto 2 sin 2 = 0r cos + constant (3.91) 

3.15. Hamilton's Equations of Motion. From the Lagrangian L(x, x, t) 
we define 

Pl = |^ t i = 1, 2, . . . , n (3.92) 

We show that p?, is a co variant vector, called the generalized momentum 
vector. Since L = L, we have 

dL _ dL d:c a _ ^L ax tt 

ax a 



which proves our statement. We shall now use q* instead of x i to repre- 
sent the coordinate system. The Hamiltonian is defined by 

H - p a q" - L(j, (?, (3.93) 
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From pi = ss F(g, g, t) we assume that we can solve for the g 1 , 

i = 1,2, . . . , n, in terms of the p, g t , and time. Thus the Hamiltonian 
H now becomes a function of the p's, g's, arid time, 

H = H(p, q, t) 

__ dH do 01 dL dJu dq 

Hence -p a _-_-_ ^ 

= ~*T 
since T-T- = p. From Lagrange's equations of motion, 

Off* 

rv T 7 / 

dL _ a I c 
~dq % ~~ di\t 

., , dp, dH 

so that -j- = ^-^ 

Also = g* *i~ PO 

dpt ^PI 

d(f~ 

~ ql = ~di 

from (3.92) and (3.93). 

Hamilton's equations of motion are 

d(f = dH 
dt dp T 
d& = _dH 
dt dq l 

Whereas Lagrange's equations of motion are, in general, a system of n 
second-order differential equations, Hamilton's equations are a system 
of 2n first-order differential equations. 

Example 3.21. Referring to Example 3.19, 

dL . p r 

Pr - W - mr q r - r - - 

&L PQ 

Pe = d? * mf *' = mr" 2 



9L . . p t 

p e ~- =mz **=*=n 



and # - pg - L 



dr _ dH _ P f 


dpr _ 


dH 


dt dp r m 


dt 


dr 


de _ pe 


dpe 


dV 


dt mr* 


dt 


de 


dz _ pz 


dp. 


dV 


dt ~ m 


dt 


dz 
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Hamilton's equations are 



mr* dr 



__ d*r p} dV Sde\z dV 

Hence m = = mr [ I 

r// 2 wr 3 dr \dt / dr 

m (I / 2 d0\ _ ] dV 

r dt\ r dt) ~ ~ rlie 

d z z _ dV 

>n ~dfi ~ ~ ~dz 

These are Newton's equations of motion for a particle using cylindrical coordinates. 

Problems 

1. Find the components of the acceleration vector in spherical coordinates and in 
cylindrical coordinates for a particle. 

2. A particle slides in a fnctionless tube which rotates in a horizontal plane with 
constant angular speed 03. Neglecting gravity, find the reaction between the tube 
and the particle. 

3. If T = a a p(q l , q z , . . . , q n )q a q & , show that 2T = q". 

4. Set up Hamilton's equations for a particle moving in one dimension under a 
Hooke's law force, F k*x. 

5. Integrate (3.91) under the assumption 6 ISO . 

3.16. Euler's Equations of Motion. We discuss the motion of a rigid 
body with one fixed point. The reader is urged to read Example 
2.10 the results of which we shall use. The x coordinate system 
will be fixed in space, and the x coordinate system will be rigidly attached 
to the moving body. We shall use the S sign to represent an integration 
over the complete rigid body. 

In vector notation the angular-momentum vector is defined to be 

H = Sr X mv 
and in tensor notation 

H k = 2me t j k x' t x 3 (3.95) 

From (3.95) 

dHk 
dt 

^\ 

(3.96) 

aiyiPA c i_ / f** / y*^ t _'f*J / ^* ~~ & / yJ'ir* i_'y*T > 7 ~* il 
Blill'v/ ciiTc*" * / *it* x *' *t/b*' * / *ti**' ^ " 



= V me ^xw + V m< k *x 
2^m^, k xx 2_ / m( "" xx 

- V i-, 

- ^me, ik xx' 
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The moment or torque vector is defined as 

d 2 r 



r X f = -^ 

** (3.97) 



Hence T = L k (3.98) 

at 

Now /:/* 

since # = co/a >z [see (2.21)]. Since e ijfc and o>f(0 are independent of the 
space coordinates, we write 



H k = e^fc 
We define /*' = Sra#^ as the inertia tensor. Since x l = A\x a ^ we have 



_ 
dx d& 

The components of the inertia tensor relative to the moving frame 
are constants since the frame is rigidly attached to the rigid body. 
This is not the case for the I d since the o are, in general, functions of 
time. Remember that the a^ are the direction cosines, which vary with 
time. Thus it will be useful to deal with the x coordinate system. In 
this frame 

H k = e^wf (3.99) 

The # transform like the components of an absolute tensor for orthogonal 



transformations since 



= \a]\ = 1 [see (1.25)]. Moreover 

- _ dX a jj _ Aa rr 

dHk _ Aa dH , dA% jj 
dt dt dt 



da* 
From A?a l a = 6}, d} = - A* - it follows that 

Ctt 

^ = A? AKZ 

x7 Ja 

so that 
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Hence [using (3.99)] 



~ 



L k 



(3.100) 



Equation (3.100) is one form of Euler's equations of motion. Since 
I %1 is a quadratic form, we can find an orthogonal transformation which 
diagonalizes this tensor or matrix (see Sec. 1.5). Let us now use this 
new coordinate system (it is also fixed in the moving body). We omit 
the bars of (3.100) for convenience. We have 

7i 
7 2 

00/3 

It is easy to see that I\ + 7 2 = A z , the moment of inertia about the 
z axis, etc. The xyz coordinate system is fixed in the moving body. 
Equation (3.100) becomes, for k = 1, 



(/2 



or 



Similarly 



= Li 



/ll) _ (/3 



UW Z I IY/22 

. dux , 

. du,, , A 4 . j 

A v~"Ti + ( A X ~ A z )u z u x = Ly 

j.. 

= L, 



(3.101) 



1. For a free body, L 



and 



Problems 
0, show that 

+ Aw^ 

+ 



l -f 
l&l + 



= constant 
= constant 



follow from (3.101). 

2. Assume that, at t = for a free body, co x = w , w w = co = 0. Describe the 
motion for 2 > 0. 

3. Solve for o^, co y , o>* for the free body with A x = A v . 

4. Show that the body of Prob. 3 precesses with constant angular speed about the 
angular-momentum vector. 

5. Derive the second and third equations of (3.101). 

3.17. The Navier-Stokes Equations of Motion for a Viscous Fluid. 

Let us first consider the motion of a fluid in the neighborhood of a point 
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P(x) of the fluid. Let the velocity of the fluid at P be given by u\ or 
u l = g va u a . The velocity at a nearby point Q(x + dx) is, except for 
infinitesimals of higher order, 

u l (x + dx) = u l (x) + du t 

- *(*> + *** 



1 /du, _ du a \ , 1 / da d,,.\ 

?i V i '"./"*' i^ <i I X I" i ' / 

2\a.r r).r'/ 2\d.r &r i / 



r/.r 

The partial derivatives arc* evaluated at the point P. Strictly speaking, 
du, is not a vector, so that we should be concerned with the intrinsic 
differential du t . The above can be written 

? y ( r _I_ fj r \ ? y ( r \ J_ 1 ( 1 i _ >ii } f] T a _L l/ ?y -4- ?/ ") f?T a (3 1 0'2 ) 

(I>^^1/ |^ \bJU ) ~~ f>7^.t / |^ ^\''*"i.,a "'cr,i/ i*wl/ j^ ^\' ; *'t,ff 1^ ^OL,%) WJi' VO. i\J4) 

We now analyze (3.102), which states that the velocity of Q is the sum of 
three terms: 

1. u t (x), the velocity of P. Q is carried along with P, so that u t (x) 
is a translational effect. 

2. The term 2-(?i,, a u a>l ) dx" corresponds to a rotation with angular 
velocity <o = ?V X u. A sphere in the neighborhood of P with center 
at P would be translated and rotated under the action of terms 1 and 2. 
It follows that terms 1 and 2 are rigid-body motions. 

3. Hence the term ^(u lta + u a>l ) dx a must be responsible if any deforma- 
tions of the fluid take place. 

We define 

*tf = i(ttw + *M) (3.103) 

as the strain tensor. Let dx 1 = x i temporarily, that is, let P be the 
origin of our coordinate system, x 1 the coordinates of the nearby point Q. 
We can write s tj x l = ^V(s t; x x 7 ) using Euclidean coordinates. However, 
s t jX l x J is a quadratic form~which can be reduced to 

9 = Sn^ 1 ) 2 + M^ 2 ) 2 + 5 M (* 8 ) 2 

under an orthogonal coordinate transformation. In the system, 
Vp = Su&i + s 22 2 j + Ss3^ 8 k. Thus, along with the rigid-body motions 
of terms 1 and 2, occurs a velocity whose components in the 5 1 , 5 2 , x* 
directions are proportional to 5 1 , 5 2 , 5 8 , respectively. The sphere sur- 
rounding P tends to grow into an ellipsoid whose principal axes are the 
35 1 , 2 , 8 axes. 

Terms 1, 2, and 3 characterize the motion of a fluid completely. In 
order to discuss the dynamics of a fluid, one must consider the stress 
tensor iff. We refer to Fig. 2.22. The face ABCD is in contact with a 
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part of the fluid. This part exerts a force on the face. This force per 
unit area has components *", t w , t* v . The y refers to the fact that the 
normal to the face ABCD points in the y direction. By considering 
the other two principal faces one is led to the stress tensor P',i,j = 1,2,3. 
The total force on a closed surface S is given by 

(3.104) 



where N 3 is the unit normal vector to the surface area dd. Let the reader 
show that (3.104) can be written as 

t$dr (3.105) 



so that t\\ represents the force per unit volume due to the stress tensor 
if 9 . The equality of (3.104) and (3.105) is essentially the divergence 
theorem of vector analysis. 

In order to obtain the Navier-Stokes equations of motion, we make the 
fundamental assumption that the components of the stress tensor be 
proportional to the components of the strain tensor. Thus 

t] = ajg2 (3.106) 

We further assume that a*g is an isotropic tensor so that 

(see Example 3.2). 
Combining (3.106) and (3.107) yields 

= k(x)b]s + l(x)s] (3.108) 

and t\ = Zks\ + ls\ = (3fc + I)l 

In hydrostatics 

-pOO 
-p 
-p 

so that t\ = 3p. The pressure p is defined to be p = $t\ for the 
general case. Thus 

I . 
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Equation (3.108) becomes 



and tj ti = pj ~ s% tj + ls} ti 

or j if = -pj - g 0-^db,, + J0*8 jfcf , (3.109) 

From Sj ktt = iw,, fcl + JM*,* and I = 2v, (3.109) becomes 

Fj = t< = p, 3 + Tsvg ak u a ,k} 4- vg kl u jt ki (3.110) 

In vector form 

F = -Vp + ~V(V-u) + i>V 2 u (3.111) 

o 

where F is the force per unit volume due to the stress tensor t]. Let f 
be the external force per unit mass so that (F + pf) dr is the total force 
acting on the element dr. Newton's second law of motion states that 

j t (p dr u) = (F + pf ) dr 

or p dr -TJ = (F + pf ) dr 

since p dr is a constant during the motion. Hence 

p *J = p f _ vp + ^ V(V - u) + V' 2 u (3.1 12) 

(/r o 

where v is the viscosity of the fluid. Equation (3.112) is the Navier- 
Stokes equation of motion for a viscous fluid. 

Problems 

1. For an incompressible fluid show that 

du _ . 

2. Consider the steady state of flow of an incompressible fluid through a cylindrical 
tube of radius a. Let u = w(r)k, r 2 = z* -f 2/ 2 . Show that p = p(z), vV 2 w = ~ 

Then show that u j- (r 2 a 2 ), ~- A = constant. 

8. Solve for the steady-state motion of an incompressible viscous fluid between two 
parallel plates (infinite in extent), one of the plates being fixed, the other moving at a 
constant velocity, the distance between the two plates remaining constant. 

4. Find the steady-state motion of an incompressible viscous fluid surrounding a 
sphere which is rotating about a diameter with constant angular velocity. No 
external forces exist. 
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CHAPTER 4 

COMPLEX-VARIABLE THEORY 



4.1. Introduction. The reader is already familiar with some aspects 
of complex numbers. We enter now into a discussion of some of the 
simpler properties of complex numbers. In order to attach a solution 
to the equation x* + 1 = 0, the mathematician^ forced to invent a new 
number, i, such that z 2 + 1 = 0, or i \/ 1. We say that i is an 
imaginary number in order to distinguish it from elements of the real- 
number field (see Chap. 10 for a discussion of this field). The solution 
of the quadratic equation ax 2 + bx + c = 0, a 9* 0, requires a discussion 
of complex numbers of the form a + pi, a and ft real. The set of all such 
complex numbers subject to certain rules and operations listed below is 
called the complex-number field, an extension of the real-number field. We 
note that the complex numbers are to satisfy the following set of rules or 
postulates with respect to the operations of addition and multiplication : 

1. Addition is closed, that is, the sum of two complex numbers is a 
complex number. 

(a + bi) + (c + di) = (a + c) + (6 + d)i 

2. Addition obeys the associative law. If z\ = a\ + bit, 2 2 = a 2 + bzi, 
2s = 3 + b s i, then 

21 + (22 + 2 8 ) (zi + z*) + z* 

3. The unique zero element exists for addition. 

= + 0-2 and 2 + = + 2=2 for any complex number z 

4. Every complex number z has a unique negative, written z, such 
that z + (-z) = (-z) +2 = 0. If 2 = a + fo, then 

-2 = (-a) + (-fr)i = -a - bi 

5. Addition is commutative. 



6. Multiplication is defined as follows: If z\ = a\ + bii, z* = a* + bji, 
then 

ziz* = (aia 2 fci6 2 ) + (aib 2 + a 2 bi)i 
122 
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The product of two complex numbers is again a complex number. This 
is the closure property for multiplication. 

7. Multiplication obeys the associative law. 

1(2223) = (2i2 2 )2 3 = 2i2a23 

8. The unique element 1 = 1 + i exists for multiplication. 

1 z = 2 1 = z for all 2 

9. Every nonzero complex number z has a unique inverse, written 
z~ l or l/z, such that zz~ l = z~ l z = 1. If z = a + bi, a 2 + b- 7* 0, then 



a 2 + 6 2 a 2 + b 2 
10. Multiplication is commutative. 



11. The distributive law holds with respect to addition. 

2i(2 2 + 2 3 ) = 2i2 2 + 2i2 3 (Z\ + 2 2 )2; { = 2i2 3 + 



4.2. The Argand Plane. The complex number z = x + iy admits of a 
very simple geometric representation. We may consider z as a vector 
whose origin is the origin of the Euclidean xy plane of analytic geometry 
and whose terminus is the point P with abscissa x and ordinate y. The 
mapping of complex numbers in this manner yields the Argand z plane. 
Addition of complex numbers obeys the parallelogram law of addition of 
vectors. We call x the real part of z, written x = III 2, and we call y 
the imaginary component of 2, written 
y = Im 2 (see Fig. 4.1). 

The length of the vector represent- 

ing z = x + iy is called the modulus ^/^ \ y =Imz 

of 2, written mod z = \z\ = (x 2 + y 2 )*. 
The argument of the complex number _ 
2 is the angle between the real x axis 
and the vector 2, measured in the 
counterclockwise sense. The argu- I0 ' ' 

ment of z is not single-valued, for if 6 = arg 2, then 9 2rn is also the 
argument of z for any integer n. We define the principal value of arg z 
by the inequality TT < Arg z g ?r. 

Example 4.1. If z 1 -f i, then \z\ -y/2, and Arg 2 r/4. If z 1, then 
|| - 1, Arg 2 - T. If 2 - -2, then |*| 1, Arg z -*-/2. 
Example 4.2. If we use polar coordinates, we may write 

2 z + iy * ''(cos + i sin 0) 
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\z\ r, arg z 0. If z\ ri(cos 0i + i sin 00, 2 2 r 2 (cos 2 + i sin 2 ), we leave it 
to the reader to show that 

ZiZt = rir 2 [cos (9\ + #2) + sin (0i + 2 )] 

- = [cos (0i - 2 ) + t sin (0i - 2 )] r 2 7* 
22 r 2 

x - 

We leave it to the reader to attach a simple geometric interpretation to multiplication 

and division of complex numbers. 

Example 4.3. The reader should verify that 

Rl (Zi + 2 2 ) - Rl 2i + Rl 2 2 

Im (21 + 2 2 ) = Im zi -f Im z 2 

\Z\Z*\ = |2i| |2 2 | 

arg ziZz = arg 21 -f arg z z 2irn n an integer 
arg r2 = arg 2l _ a rg z 2 2irn 

-W ^Rl2 |2| 
-|2| ^ Im 2 ^ |2| 

If 2 = x + ^2/> we define the complex conjugate of z by the equation 
z = x iy. The conjugate of a complex number is obtained by replac- 
ing i by i. Obviously 



z + z = 2 Rl z 
z z = 2i Im z 

zz = x* + y* = |z| 2 

From |si + 2 2 | 2 = (^i + 22) (21 + 22) we let the reader deduce that 
\z\ + zz\ ^ |^i | + \Zz\y and from this that \zi z z \ ^ | \Zi\ \z*\ \ for all 

Zl, Z 2 . 

4.3. Simple Mappings. Henceforth the complex number z will stand 
always for the complex number x + iy> We now examine the complex 
number w = z 2 = (x + iy) z = x 2 y* + 2xyi. It will be highly bene- 
ficial to construct a new Argand plane, called the w plane, with 

w = u + iv 

u and v Euclidean coordinates. The equation w = z 2 may be looked 
upon as a mapping of the complex numbers of the z plane into complex 
numbers of the w plane. The transformation w = z 2 maps the point P 
with coordinates (x, y) of the z plane into the point Q of the w plane whose 
coordinates are u = x 2 i/ 2 , v = 2xy. A curve in the xy plane will, in 
general, map into a curve in the uv plane. For example, the transforma- 
tion w = z 2 maps the straight line x = t, y = mi, <x> < t < oo , into 
the straight-line segment u = (1 m 2 ), v = 2mt, g t < . 

The hyperbolas x 2 y 2 = constant = c map into the straight lines 
u c. The hyperbolas xy ** c map into the straight lines v = 2c. 
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Example 4.4. We now examine the transformation w = z + 1/z, z ^ 0. We have 

z =s r(cos -f- i sin 0), w = u -\- iv, so that 

u -\- iv = r(cos + i sin 0) -f- - (cos 6 i sin 0) 
and ?/ = (r -f - J cos 

?' = ( r -- ) sin 

From cos 2 + sin 2 = 1 we have 

2 

f ^ 



__ _ __ 

(r + f/r) (r - I//) 

The circles r = a 7* 1 map into the ellipses 



r/2 ,,2 



(a + I/a) 2 Ma - I/a)' 
of the uv plane. Into what curve does the circle r 1 map? 

Problems 

1. If a + 7>? = c -f di, a, 6, r, </ real, show that a = c, 6 d. 

2. Show that 2 = for all z by use of the distributive law and the definition of 
the zero element. 

3. From |z, -f- z 2 2 = (zi + 2 2 )(2i + 22) show that |zi -f 22! ^ |i| -f- |22|. 

4. Show that (cos 6 -\- i sin B) n = cos (n6) + i sin (n0), n an integer. This is a 
formula of De Moivrc. Obtain an identity involving cos 40. Solve for the roots of 
2 s = i f Z 4 = 1? 2 6 i, 

6. Examine the transformation w = z 1/z in the manner of Example 4.4. 

6. Examine the transformations w az, a complex, w = z -f h, b complex, w = 1/z. 
Examine the important bilinear transformation w (az -f- b)/(cz + rf), a, 6, c, d 
complex. 

4.4. Definition of a Complex Function. Let Z = \z} be a set of com- 
plex numbers defined in some manner. Now assume that by some rule 
or set of rules we can set up a correspondence such that for every point z 
of Z there corresponds a unique complex number w,j^nd let W = \w\ 
be the totality of complex numbers obtained in this manner by exhausting 
the set Z. We thus have a mapping Z > W which defines a complex 
function of z over the set Z. The correspondence between the element z 
and the element w is usually written 

w=f(z) (4.1) 

It is customary to consider f(z) as the complex function of z. We can 
write f(z) = u(x, y) + iv(x, y), since, given z, we are given x and t/, 
which in turn yield the real and imaginary parts of w, called u(x, y) and 
v(x, y), respectively. 
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Example 4.5. Let Z be the set of complex numbers z such that \z\ > 1. The 
correspondence z > w such that w ** z/(\z\ 1) defines a complex function of z for the 
domain of definition of z. In this case 

/ \ x / \ y 

U(X. U) = V(X t V) = , .:::.... 

y Vx^TV 2 - 1 vV + y 2 - i 

with x 2 + ?y 2 > 1 

Example 4.6. Let Z be the finite set of complex numbers z = 1 and 2 = 2. Let 
2 1 correspond to ir =* 5, 2 2 correspond to ?r = 1 + i. This mapping defines 
a complex function of z defined over the set Z. 

Example 4.7. Lot Z be the set of real integers, and let z of Z correspond to the 
constant w = ? for all z of Z. Note that m this case more than one element of Z 
corresponds to the same value w. A complex function can be a many-to-one mapping. 
Remember, however, that only one w corresponds to each z. This is what we mean 
by a single-valued function of z. 

4.6. Continuous Functions. We define continuity of a single-valued 
complex function in the following manner : Let the points z of Z be mapped 
into the points w of W, and consider the two Argand planes, the z plane 
and the w plane. We say that /(z) is continuous at z = z , z in Z, if the 
following holds: Consider any circle C of nonzero radius with center at 
WQ = /(zo). We must be able to determine a circle C' of nonzero radius 
with center at zo such that every point z of Z interior to C" maps into a 
point w of W interior to C. Analytically this means that, given any 
> (the radius of the circle C), there exists a 6 > (the radius of the 
circle CO such that 

I/CO -/(Z )| < 

for all z of Z such that |z z | < 5. This is the usual , 5 definition of 
continuity of real-variable theory. If /(z) is continuous at every point 
z of Z, we say that /(z) is continuous over Z. 

If Zij i = 1, 2, 3, . . . , is an infinite sequence of Z tending to zo as a 
limit, Zo in Z, then the above definition of continuity implies that 

lim /(z) = /(z ) 

e *o 

Let the reader verify this. 

Example 4.8. Let f(z) z for all z. We easily note that this function is continuous 
for all z since we need only pick 6 e for every choice of e > 0. 

Example 4.9. Let/(z) and g(z) be defined over the same set Z [z\, and assume 
that/(2) and g(z) are continuous at the point z in Z. We now show that/(z) 4- g(z) 
is continuous at z . Choose any e > 0, and consider e/2 > 0. Since f(z) is con- 
tinuous at ZQ there exists a 61 > such that \f(z) f(z )\ < e/2 for \z z \ < 61, z in 
Z. A similar statement holds for g(z), with 5 2 replacing 61. The reader should be 
able to show that \(f(z) + g(z)) - (/( ) + 0(o))| < c for \z - 2 | < a, z in Z, the 
smaller of 5i, 6j. 
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Example 4.10. Let the reader prove the following statements: Let/() and g(z) be 
denned over Z, and assume /(z) and g(z) continuous at ZQ in Z. Then 

! /(z) ~ 0( z ) is continuous at ZQ. 

2. f(z)g(z) is continuous at z . 

3. f(z)/g(z) is continuous at z provided g(z Q ) ^ 0. 

Problems 

1. Prove the statements of Example 4.10. 

2. Show that/(z) * aoz w + aiz n-1 -f + o n , n a positive integer, is continuoun 
for all z, z ^ oo . 

3. Show that the /(z) of Examples 4.6 and 4.7 are continuous over their domain of 
definition. 

4. Let /(z) - ( 8 - l)/(z - 1), z 5* 1, /(I) 3. Show that /(z) is continuous at 
z - 1. 

5. Let /(z) be continuous over a closed and bounded set. Show that /(z) is uni- 
formly continuous (see Sec. 10.12 for the definition of uniform continuity). 

6. Let/(z) = 1/(1 z), z 7* !,/(!) 5. Show that /(z) is discontinuous at z 1. 

7. Show that /(z) x sin (1/x) -f ly, x 7* 0, /(O) = 0, is continuous at the origin 
(0, 0). 

4.6. Differentiability. Let /(z) be defined for the set Z = {z}. Let 
z be a point of Z, and let Zi, Z2, 3, . . . ,z n , . . . be any infinite sequence 
of elements of Z which tend to z as a limit. None of the z,, i = ], 2, 3, 
. . . , is to be equal to z . We say that /(z) is differentiate at z if 

Jin, /fa) - /^ (4.2) 

exists independent of the sequential approach to z . We can state 
differentiability in an equivalent manner, /(z) is differentiate at ZQ 
if there exists a constant, written/' (20), so that for any > there exists 
a 6 > such that 

/CO - /(go) 



z - zo - < 

whenever |z ZQ| < 5 for z in Z, z ?^ ZQ. 

In the cases we shall be interested in, ZQ will be an interior point of Z 
(see Sec. 10.7) for the definition of an interior point. In this case, (4.2) 
becomes 



/(2o) = lim - (4.3) 



independent of the approach to zero of Az. Let us consider 

f(z) = w(x, y) + iv(x, y) 
and investigate the conditions that will be imposed on u(x, y) and v(x, y) 
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in order that 



shall exist independent of the approach to zero of Ag. We have 

Ag = Ax + i Ay 

f(z + Ag) = u(x + Ax, y + Ay) + iv(x + Ax, y + Ay) 
/(g + Ag) /(g) ?/(x + Ax, y + Ay) - w(x, y) 



so that 

Ag ax -r- z ay 

. v(x + Ax, y + Ay) - v(x, y) (4 .. 
^ Ax + t Ay ^ ' ' 

First we let Ag > by keeping y constant, that is, Ag = Ax, Ay = 0. 
Then (4.4) becomes 

/(g + Ag) - /(g) _ u(x + Ax, y) - u(x, y) v /<x + Ax, y) - y(x, y) 



and hm 



Ax Ax 

/JiMlI/<E> *- 



provided and exist. Now we let Az - by keeping x constant, 
ox c/x 

that is, Ax = 0, kz = i Ay. Then 

/(* + A^) - f(z) _ldu dv 

11 111 ----- m ..... -- -p ~7r~ 

Az i dy dy 

dv . du 



provided and T- exist. Assuming differentiability of /(g) yields 
ay ay 

dtt . dy a/ ( . du 
-- 1- ^ = _ ? 

ax ax dy dy 

^i_ , & u & v /4 -N 

so that = (4.o) 

dx dy 

dv ___ du 
dx dy 

The Cauchy-Riemann conditions (4.5) are necessary if /(g) is to be 
differentiable. We have, however, neglected the infinity of other meth- 
ods by which Ag may approach zero. But it will turn out that the 
Cauchy-Riemann equations (4.5) will be sufficient for differentiability 
provided we further assume that the partial derivatives of (4.5) are 
continuous. Let us proceed to prove this statement. The right-hand 
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side of (4.4) may be written 

u(x + Ar, y + A,y) - ?/Q, y + A//) + u(x, y + Ay) - u(x, y) 

Ax + i A// 
-L. - *'( + As, y + A?y) - r(j, y + A//) + r(x, ?/ + Ay) - v(x, y) 

" " ' 



- 

" " Ax + i 'Ay 

s Ax (0tt/a.r + tO + Ay (du/dy + fa) 
AJ + ? A?/ 

, . Ax (frj/to + fa) + Ay (dv/dij + fa) 
"*" l Ax + i Ay 

where fa, fa, fa, fa tend to zero as Az > 0. We have applied the theorem 
of the mean of the differential calculus. The t > if we assume con- 
tinuity of the partial derivatives. Making use of the Cauchy-Riemann 
equations yields 

f(z + Az) - f(z) = du . dv fa Ax + fa Ay + f (fa Ax + fa Ay) 
A (9.r <3.r Ax + i Ay 

We leave it to the reader to show that 

H,n ^ Ax + ?a A// + 7(6 Ax + fa Ay) 

11 in ---------- - - ; 7. - u 

Ax + i Ay 



TT r f( z + Az) /(z) . , , . du . . dv . , , x , 

Hence lim - --^ exists and equals - \- i -^- independent of 
Az dx dx ^ 



the manner of approach to zero of Az. We have proved Theorem 4.1. 

THEOREM 4.1. f(z) = u + iv is differentiate if u and v satisfy the 
Cauchy-Riemann equations and if, furthermore, the four first partial 
derivatives of u and /' with respect to x and y are continuous. 

The following example shows that we cannot, in general, discard the 
continuity of the partial derivatives. Let 



Here u(x, y) = ^ % w(0, 0) = 

(*, 2/) = [-^77! KO, 0) = 



Moreover 
dx 



0,0 



* - 



Similarly = -1, ^ = 1, T = 1 > at 2 = 0, so that the Cauchy- 
Riemann equations hold at z = 0. However, /'(O) depends on the 
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approach to the origin, for let y = mx, so that 

f/ v x(\ + i) xm 3 (l i) 

f(z} = - nr^ - 

along y = rax, and 



,./(*) -/(O) = rm (1 + i) - m(l - t) x = (1 + f) - m 3 (l - t) 
^2-0 (1 +ro 2 )(l+roi) x (1 + m)(l + mi) 

which depends on m. Let the reader show that is discontinuous at 

dx 

the origin by showing that lim does not exist. 

- 



.Q 





OX 

- 

DEFINITION 4.1. Let/Os) be differentiable at z = 20. If, furthermore, 
there exists a circle with center at ZQ such that /() is differentiable at 
every interior point of that circle, we say that f(z) is regular, or analytic, 
at ZQ. Analyticity at a point is stronger than mere differentiability at 
that point (see Prob. 10 of this section). If /(z) is not analytic at 20, we 
say that ZQ is a singular point of f(z). 

Example 4.11. We show that /(z) z* is differentiable everywhere. Since 
f(z) (x 4- t'/y) 2 = x 2 v 2 -f 2tf?/i, we have ?/ =* x 2 # 2 , t> = 2xy. Hence 



// _ _ _ r u _ rt _ _ 
~ X ~ j ~ ~~ y 



so that the Cauchy-Riemann equations hold. Moreover it is obvious that the partial 
derivatives are continuous. Thus f'(z) - \- i ! = 2x -f 2yi 2(x + ^2/) " 2z. 

c/X c/3J 

We could have shown that f'(z) exists and equals 2z by applying (4.3). 



Let the reader show that if /(z) z n , n an integer, then/ ; (z) nz!*~ l . 
Example 4.12. The reader should verify that if /' (z<>), g'(zo) exist, then 

(1) ~ \f(z) + g(. 

(2) ~ [/(s)(/(r)]i 



We assume that /(z) and 0(2) have the same domain of definition. 

Problems 

1. If u and v satisfy (4.5) and if their second partials exist, show that V*u V*t; 0. 
Also show that 

du du . dv dv ~ 
dx dy dx dy * 
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Give a geometric interpretation of this last equation. Remember that 

_ du . . du . 
Vu - 1 + j- J 

is a vector normal to the curve u = constant. 

The fact that u and v must satisfy Laplace's equation proves to be useful in the 
application of complex-variable theory to electricity and hydrodynamics. 

2. Let f(z) u + iv be differentiate, and assume u is given such that V*u 0. 

~. . , x f x du(x. y) , , fy du(xQ. y) , T , 

Show that v(x, y) = / r dx + / . ay. Let 

Jxo dy Jyo dx 

f(z) x* 3xy* -f iv(x, y) 

be differentiate. Find v(x, y). Note that V 2 (x a - Sxy 2 ) = 0. 

3. Show that f(z) = x iy is not differentiate. 

4. Show that/(z) - x* 3xy z -f t(3x*y j/*) is analytic everywhere. Then show 
that/(2) - z*. lif(x + iy) - u(x, .v) + *v(x, y), show that/(z) ?^(z, 0) -f it;(2, 0). 

5. Let sin z = sin x cosh ?/ -f- ^ cos x sinh y. Show that sin 2 is analytic every- 
where. Is this definition of sin z consistent for z real? Define cos z s ^ , and 



dz 



show that sin 2 z -f cos 2 2 = 1. 

6. If f'(z) for all z, show that/() constant. 

7. Prove the statements in Example 4.12. 



8. If/00 - a n z n -f an-ix"- 1 -f - - -f a - V a*^, show that f (2) - Y 



Ar-O 



9. If /() 



a2 tt converges for ^ |z| < R, show that/' (2) 



na n 2 n ~ 1 for 



g\z\ <R 

10. Show that /(2) = x 2 // 2 is differentiable only at the origin. 

11. Assume /'(a) and g'(a) exist, g'(a) ^ 0. If /(a) gr(a) 0, show that 



y 



z-+a g(z) g'(a,y 

4.7. The Definite Integral. The reader is urged to read first those 
sections of Chap. 10 concerning the uniform continuity of a continuous 
function, rectifiable Jordan curves, 
the Riemann integral, and Cauchy 
sequences. Let F be a rectifiable 
Jordan curve 1 joining the points 
z = a, z = ft. Let f(z) be a continu- 
ous complex function defined on F. 
We are not concerned with the defi- 
nition of f(z) elsewhere. Neither do 
we introduce the differentiability of 
f(z). The Riemann integral of f(z) 
over F is defined as follows: Subdivide F into n parts in any manner 
whatsoever. Call the points of the subdivision a = zo, z\, z%, . . . , Zk, 

1 Called a simple curve. 



o 




FKJ. 4.2 
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. . , 2 n = (see Fig. 4.2). On each of the paths zoZi, ZiZ 2 , . . . , 
I, i 2n-i n choose a point 1, { 2 , . . . , *, &+i, . . . , n, and 
form the partial sum 



If lim J n = J exists and is unique independent of the choice of the 

n > 

& and independent of the method of subdividing F provided only that, 
as n tends to infinity, the maximum of the arc lengths from z*_i to z*, 
k = 1 , 2, . . . ,n, tends to zero, we say that f(z) is Riemann-integrable 
over F and write 

j = P f( 2 ) dz over T 

Jot * 

= P /(*)& = if(z)dz (4.7) 

y(D yr 

This definition agrees with the definition of the Riemann integral of real- 
variable theory if F is a section of the real axis, F: a ^ x ^ 0, and if 
f(z) is a real function of x. 

We now show that, if f(z) is continuous on the rectifiable Jordan arc 
given by x = /(/), ?/ = ^(0> ^o ^ ^ ^ <i, then the Riemann integral of /(z) 
over F exists. The proof proceeds as follows: Let us first look at any 
arc of F joining z = a to z b and consider 



where is any point on the arc joining z = a and z = b. A further sub- 
division of this arc into ZQ = a, zi, Z2, . . . , z n = 6 yields the partial sum 
defined as in (4.6), 

n 

- 2*-,) 



Now 5 = /({)(& - a) = /(() V (^ _ ^,) = 

k=l k - 1 

w 

so that S - S n = (/({) - /(&))(* - *-i) 



If furthermore the maximum variation of /(z) on the arc joining z = a, 
z = 6 is o-, then 

n n 

|S - 8 n \ y \f(& - /(&)| |* - ft-il ^ <r ^ k* - -i| ^ <rL (4.8) 
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where L is the length of arc from z = a to z = 6. Why is 



This result will he important in what follows. Now we use the property 
that f(z) is uniformly continuous on F. Choose a subdivision of F so 
that the maximum variation of /(z) on any segmental arc of the sub- 
division is less than , and obtain J \ [see (4.6)]. Now impose a finer sub- 
division on the previous subdivision such that the maximum variation 
of f(z) on any segmental arc of the new subdivision is less than 1/2 2 , and 
form a J 2 for this subdivision. Continue this process. For </ the 
subdivisions are so fine that the maximum variation of /(z) on any seg- 
mental arc is less than l/2 n . Moreover the maximum subdivision tends 
to zero in size. We obtain the sequence of complex numbers 

,/!, J 2 , J 8 , . . . , ,/, . . . (4.9) 

We now show that lim J n exists. Choose any e > 0. We can find an 

n * 

integer no such that l/2 n < e/L for n ^ no, where L is the length of arc 
of F. Now for m ^ n ^ HQ we have 

\J m - Jn\ <^L < 

L 

using the result of (4.8). Hence the sequence (4.9) is a Cauchy sequence 
and 

lim J n = J 

n 

We must now show that J is the same limit for any other choice of the 
&. For the new choice of the & we obtain the sequence 

Tf Jf If 

J j, J 2 , . . . , t/ n , . . . 

and by exactly the same reasoning as above we have 

lim J' n = J' 

n>*> 

But \J n J'n\ < (V2 n )L, and we leave it to the reader to show that this 
implies J = J'. 

The final step is to show that the same limit J occurs for any other 
method of subdividing provided the maximum length of the subdivisions 
tends to zero. For any other sequence K\, K*, . . . , K n , ... of partial 
sums of the form (4.6) we can superimpose the subdivisions which yield 
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K n on the subdivisions of J n of (4.9). Then 



from (4.8). We leave it to the reader to show that lim K n = lim J n . 

n n * 

This concludes the proof. The reader is referred to that excellent text 
by Knopp, "Theory of Functions," Dover Publications, 1945, for a much 
clearer expository of the Riemann integral. 

For the reader who has trouble in understanding what has been 
attempted let us note that if /(z) = u(x, y) + iv(x, y) and if T is given by 
x = x(i), y = y(t), to ^ t g fa, then dz = dx + i dy, 

f(z) dz = u(x, y) dx - v(x, y) dy + i[u(x, y) dy + v(x, y) dx] 
so that it would be logical to define 

f r f(z) dz = j r u(x, y) dx - v(x, y) dy 

+ i [| r u(x, y) dy + v(x, y) dx] (4.10) 

where I u(x, y) dx, etc., are the ordinary line integrals of real- variable 

theory. It is not very difficult to show that this definition and existence 
of the line integral agrees with that discussed above for a continuous 
f(z) and hence a continuous u(x, y), v(x, y). In particular if the Jordan 

curve is regular in the sense that -rr and -~ are continuous, then (4.10) 
becomes 

dz = ' (*(<), y(t)) ^ - v(x(t), y(0) ^ j dt 

(x(0, y(t)) - + v(x(f), y(t)) dt (4.11) 



Example 4.13. We evaluate / f(z) dz, where /(z) = z and the curve F is given as 

the straight line joining the origin and the point (3, 4). F may be written x 3, 
y - 4J, t ^ 1. Along F 

/() - 2 - x -f ty - (3 4- 4i)( 

Moreover the vector z to the curve is z (3 -f 4i)i so that dz (3 -f- 4t) rf< and 
/(*) dt - (3 + 4)"< dt. 

r i 
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We could save ourselves all this work if we knew that 

rp 



f 



zdz 



for any simple path F from z a to z = /3. Let the reader show that this is true. 
Hint: subdivide F into a = z , z\, z, . . . , z w = 0, and consider 



n 

) Zt_.(Zt - Zi-l 



Show that J w 4- J' n = /3 2 a 2 , and let n > o . 

In this example we notice that / z dz depends only on the end points of V and not 
on r itself. The significance of this will become apparent in the next section. 

Example 4.14. We evaluate / f(z) dz for /(z) = x iy with F the straight line 
from (0, 0) to (1, 1). Here x = t, y t, ^ t 1, 2 - * + fl, dz - (1 + t) ctt so that 

J r /(z) dz = JJ (1 - ?)(! + O/ ^ = 1 

For the same/(z) let us take F as the sum of the two straight-line segments, one from 
(0, 0) to (1, 0), the second from (1, 0) to (1, 1). For the first path x = t, y * 0, ^ t 

^ l,sothatdz = dt and/(z) = t and / f(z) dz \. Along the second path a: 1, 

yi'i 
dx = 0, y - , d =* i dt, g t ^ 1, and /(z) - 1 - i< so that 



I dz - 

JTs 

Hence 

r r 

i /GO - 

Thus, for /(z) = x iy, the line integral is not independent of the path. One may 
guess that the nonanalyticity of /(z) x ly may be the answer. The next section 
will verify this fact. 

Example 4.15. The function /(z) ** 1/z is continuous and analytic everywhere 
except at the origin. Let us compute ff(z) dz, where F is a circle of radius a with 
center at the origin. It is easy to see that z = a (cos 4- i sin 0), ^ :S 2ir, repre- 
sents the circle. Hence dz a( sin + i cos 0} d0, and 



f 
7 



- sin 9+ i cos = ( _ 8in(> + I - c08( , )(cosfl _i 8ine)(W 



cos 8 + i sin 8 

'2r 




Even though /(z) is analytic on F, //(z) dz 5^ 0, in this case. We shall see that the 
singularity of /(z) at the interior point z accounts for this fact. Notice that the 
value of the integral is not dependent on the radius of F. Indeed, it will be shown 
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that for any closed rectifiable path F, with the origin in its interior, one has 



r z 
Problems 



1. Show that f ft z*dz = i(/3 3 - 8 ). 

Ja 



2. Show that f(z - z ) m dz = 



if ra ^ - 1 

27rt if w = - 1 

w a positive or negative integer, if the integration is performed around the circle F 
with radius a and center z . The integration is performed in the counterclockwise 
sense. 



3. Define t |/(z)| \dz\, and show that 

\J r f(z)dz f r \f(z)\\dz\ 

4. Show that ff&dz = - f a f(z)dz. 

J <x J & 

(r) (F) 

6. If \f(z}\ ^ M along F and L is the length of F, show that 



f(z] dz 



LM 



6. Why is it that / cf(z) dz = r / /(z) f/2 for r constant? 

7. Why is it that J r /iGO ^ + f^f*(z) dz = J r [/i( 2 ) + / 2 ( 2 )] d? 

8. Evaluate / (z + ||) dz, where F is the straight line from (0, 0) to (1, 1). Do 
the same for the other path discussed in Example 4.14. 

9. Evaluate / along the path F consisting of FI and F 2 , where FI is the straight 

line from (1, 0) to (|z |, 0) and Fs is the arc of the circle from (|z |, 0) to z with center 
at the origin. Is the value of the integral single-valued? 



y) 



C(x, y) 



4.8. Cauchy's Integral Theorem. The fundamental theorem of com- 
plex-variable theory is due to Cauehy. There are various forms of this 

theorem. We present now a proof 
of one form. 

Let S be a simply connected 
open region such that the partial 
derivatives of u(x, y} and v(x, y) 
are continuous and satisfy the 
Cauchy-Riemann equations, 

f(z) = u(x, y) + iv(x, y) 

at every point of S. In Sec. 4.6 we saw that] these conditions were sufficient 
for f(z) to be analytic in S. We shall show that, if T is any closed simple 







FIG. 


JBfe 

4.3 
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curve inside S, then 

j) T f(z)dz = (4.12) 

The statement concerning (4.12) is Cauchy's integral theorem. We 
prove first that $J(z) dz = for the rectangle C (see Fig. 4.3) contained 
entirely in S. 

We can obtain a single-valued complex function of the two real var- 
iables x and y by defining 

F( x , y) = f x f(t + ij/o) dt + i f y f(x + it) dt 

F(XJ y) is the sum of the integrals of f(z) along the straight lines AB and 
BC. Similarly G(x, y) is obtained by integrating /(z) along AD and DC. 
The integration of /(z) around the rectangle C in the counterclockwise 
sense is 

<f> c f(z) dz = F(x, y) - G(x, y) 
where G(x, y) = i I ' f(x, + it) dt + I* f(t + iy) dt 

Jyo Jxo 

If we can show that F(x, y) = G(x, ?y), then (p /(z) dz = 0. 

J c 

Let the reader show that 



J I/O 



V-^ dt (4.13) 

U* J yo OX 

We need 

4-1 f(* + W) dt = /(x + iy ) 
dx J x 



dx J yo J yo dx 

in order to obtain (4.13). These statements are proved in Chap. 10 for 
/ real. The student need only write / = u + iv and apply the theorems 

of real-variable theory to u and v separately. The continuity of - 

ox 

is used to perform the differentiation underneath the integral. Also 
we have 

dF 

jj = if(x + iy) = if(z) 

dC 1 

= f(x + iy) = /(*) (4.14) 



dG ... . . . . * df(t + iy) 

Ty - ** + *) ^^ 



138 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

As yet, we have not made use of the Cauchy-Riemann equations. Thus 

. dv 



df du dv dv . du __ . f du 

dx dx dx dy dy \dy 

, f v df(x + it) 

' L dt 



dF 



Hence = f(jc + 



dt = f(x + iy) = (4.15) 



Similarly ^ - ^, and hence dF = dG, F = G + c. Since F = G = 

at (#o, 2/o), the constant c is zero. This proves that a) f(z) dz = 0. 
We now consider (ft /() cte, where Fo is any closed rectifiable Jordon 

curve lying inside the rectangle ABCD. Parts of the curve may be 
segments of the sides of the rectangle (see Fig. 4.4). Let P(x, y) be any 
point on F , and define F(x, y) at P in exactly the same manner in which 
F was defined at C(x, y). From (4.14) and (4.15) 

dF dx + dy = f(z) dx + if(z) dy 

= f(z) (dx + i dy} = f(z) dz 
Hence <f> f(z) dz = (f) dF - (f> r/T + i <f> dV = 

jTo * / To J I'o / To 

where F = f / + zT. Certainly /df 7 = fdV = 0. 

The final part of the proof uses the following reasoning: Let F be any 

closed simple curve in S. F is a closed set of points. Now S was assumed 

to be an open set. If we adjoin to S 
its boundary points, we obtain S, a 
closed set, consisting of S and the 
boundary of 8, say, F. There will be 
a minimum distance between F and 
T not equal to zero. We can there- 
fore impose a fine enough mesh on S 

A / v (the mesh consists of rectangles) so 

0> that the rectangles which contain the 

points of F will be entirely in S. 

The integration over all rectangles interior to F plus the integration 

of f(z) over those boundaries which include F vanish from the above 

results. However, all integrations over the rectangles interior to F 

vanish in pairs, leaving 

<j) f(z) dz = (4.16) 

The condition that the partial derivatives of u and v be continuous 
can actually be omitted. The proof that <b f(z) dz = for the rectangle 
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C under the condition that /(z) be analytic in S can be shown as follows : 
First assume (p /(z) dz 7* 0. Subdivide the region R (C and its interior) , 
into four new regions (see Fig. 4.5) by halving the sides of the rectangles. 
Now (p /(z) dz equals the sum of the integrals of /(z) over the boundaries 

of the four new regions since the internal integrations cancel each other in 
pairs. Hence at least one of the four integrals does not vanish. We 

choose that boundary Ci for which (p /(z) dz has the largest value. 

J Ci 

Again we subdivide, choose C 2 , and continue this process indefinitely. We 
obtain a sequence of regions /?, Ri, ff 2 , . . . , /?, . . . , with boundaries 
C, Ci, C 2 , C 3 , . . . ,C W , . . . such that 

(D f(z) dz 7* 0. The regions R n , n = I, 

2, ... are closed and bounded sets, 

and the diameter of R n tends to zero 

as n > oo . From the theorem of nested 

sets (see Chap. 10) there exists a unique 

point P which belongs to every R n , 

n = 1, 2, 3, . . . . Let 7 J be the point z 

or zo lies on C. Hence /(z) is differentiate at z = zo so that 

/(*) = /(*o) +/'(2o)(z - *) + e(z, z fl )(z - z ) 

where t(z, zo) tends to zero as z tends to z . Hence, given any c > 0, we 
can pick a region R n with boundary C n such that |c(z, z )| < eo for all z 
on C n . Now 



-H-- 



Fio 4.5 
Obviously z is interior to /?, 



But 



so that 



Cn 



cfe + f f'(z*)(z - 

./Cn 



dz 



<fc /(z) dz = / e(z , )( - o) 
y c n yCn 



Remember that $ dz = 0, j>z dz = 0. For any positive e we choose n 
sufficiently large so that |c(z, ZQ)| < eo for all z on (7 n . If l n is the length 
of the diagonal of the rectangle C n , then 



/(z) dz 



\z - z | |dz| ^ 



4 



since l n = L/2 W , where L is the length of the diagonal of C. Hence 

4L 2 



< 4 n 



4 L 2 
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Since eo can be chosen arbitrarily small, the constant u) /(z) dz must be 

zero. Q.E.D. The proof of Cauchy's theorem follows then in exactly 
the same manner as demonstrated above. This proof is due to Bliss in 
the American Mathematical Society Colloquium Publications, vol. 16. 

A very strong statement of Cauchy's theorem is as follows: Let F 
be a simple closed path such that /(z) is analytic in the interior R of F 
and such that /(z) is continuous on F. Then 

ff(z) dz = 

The proof is not trivial and is omitted here. Continuity of /(z) on F in 
this case means lim /(z) = /(f ), f on F, z on F or in R. 

We now state some immediate consequences of Cauchy's integral 
theorem : 

A. Let /(z) be analytic in a simply connected region R. For z = a, 
z = 0, in R, 



is independent of the simple path chosen from z = a to z = /3 provided 
the path lies entirely in R. Let the reader verify this statement. 

B. Let /(z) be analytic in a region R bounded by two simple closed 

paths Ci, C V 2 (see Fig. 4.6). Then (p /(z) dz = (D /(z) dz provided 

/ C'i J Ci ' 

both integrations are done in a clockwise sense or a counterclockwise 




Fi. 4.6 




sense. The proof is as follows: Construct the paths AB and EF (see 
Fig. 4.7). From Cauchy's theorem 



Adding yields 



and 



/(,) dz 



/(,) 
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Notice that nothing need be known about f(z) outside of Ci or inside of 
C%. We have assumed f(z) continuous on d and C*. 
C. For Fig. 4.8 let the reader show that 



<j> T% f(z) dz = 



/(z) is analytic inside F and outside 
To, FI, F 2 . Generalize this state- 
ment for the curves F , FI, F 2 , . . . , 

D. Let /(z) be analytic inside a 
simply connected open region R, 
and consider 



dz + <f> r j(z) dz 

FI and F 2 and is continuous on 



= f'f(t)dt 

Jzo 



where zo and z are in R. The path 
of integration is omitted since the 
integral is independent of the path (see A). Hence 




FIG. 4.8 



F(z + Az) - F(z) = 



+ f z f(t) dt 

J z 



dt 



We choose the straight-line path from z to z + Az as the path of integra- 
tion. Then = z + M Az, 0^/x^l, along this path, dt = dp Az, and 



F(z + Az) - 



* "Jo 



f(z + n Az) 



Since f(z) is analytic at z, 



and 
where lei 



f(z + n Az) = f(z) + 
as &z > 0. Hence 



/'(* 
) + 



Az 



r d + Azff(g) r d + ^ r 

Jo Jo Jo 



(4.16') 






It is obvious that 






so that F(z) is analytic in R. 

E. Let f(z) be continuous inside a simply connected open region R 9 
and assume 

F(z) = f'f(t) dt 
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is independent of the path from ZQ to all 2, the path lying entirely in R. 
We show that F(z) is analytic in R. The variable t is simply the complex 
variable of integration. The reader can prove this statement easily 
enough by proceeding as in D. Continuity implies that 

f(z + z) =/(z) +r, (4.16") 

where 17 > as Az 0. In the proof of D it was not necessary to use 
the analyticity of f(z) twice. We could have used (4.16") in place of 
(4.16'). 

F. The fundamental theorem of the integral calculus applies equally 
well to the theory of complex variables. Assuming the conditions stated 
in D yields 

F(z) = /(*) dt F'(z) = /(a) 



Let G(z) be any function such that G'(z) = f(z). Then 

j| (F(z) - G'(*)) = 

and F(z) = G(z) - C. Hence G(z) - C = f* f(l) dt, 
G( Zo ) - C = l'"f(0dt = 

Jzo 

so that 

0(z) -G(*o) = f'f(t)dt 

J 20 



As a simple example, / z 2 dz = 1(0 3 a 8 ) since -7- (|0 8 ) = z 2 . 

^2 

Problems 

1. Show that (7) ^ = 2iri for any simple closed path r, z in the interior of T. 

/ Z ZQ 

Show that u) --- = if ZQ is exterior to F. 

/ Z - Z 

2. Evaluate u) ----- for any simple closed curve F enclosing the circle r = 1. 



Use C above, and write I/ (z* z) I/ (z 1) 1/2. 

8. Construct a function f(z) such that f(z) dz = for all simple closed paths, /(z) 
not analytic. Docs this contradict Cauchy's theorem? 

4. Let/(z, /) be a complex function of the complex variable z and the real variable /. 

Assume J(z t t) and -^-^ analytic in z for U ^ t ^ t\ t and consider 

F(t) = f'f(z, /) dz 
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If, furthermore, -^ (z, t) is continuous in z and t, show that 

vt 

f*G(u)du - F(t) - F(t ) 

i />/.*&* 

6. Let /(z) and #(z) be analytic in a simply connected region R. From 

*f(z)g'(z) + f'(z)g(z) 



show that 

-/()?() - [* g(z)f'(z)dz 

J a 

The path of integration from z a to e = |8 lies in 72. 

4.9. Cauchy's Integral Formula. A truly fundamental consequence of 
Cauchy's integral theorem of Sec. 4.8 is the following formula due to 
Cauchy : Let /(z) be analytic in the simply connected open region R, and 
let F be a simple closed curve in R. Then 



if z a is an interior point of F. The sense of integration around F 
is such that as we move around F the region containing z = a lies to our 
left. The proof is as follows: From Sec. 4.8B we can replace the curve of 
integration F by any circle F with center at z = a, F interior to F. 



Then 



27rz jYz a 2iri /TO z a 

f(z)b(- sin + icos 6) dB 



!_ 
2*i Jo 



o b (cos 8 + i sin 6) 

+ 6 ( cos ^ + i siri ^)1 rf<9 ( 4 - 17/ ) 

since 2 = a + fr(cos B + i sin 6), ^ ^ ^ 27r, is the ecjuation of F , b 
the radius of F . Since /(z) is continuous at z = a, we have 

/[a + 6(cos ^ + i sin 0)] = /(a) + i\ 
where 17 > as fe * 0. Hence 

f sin B)] dB = 2ir/(a) + ** 17 d0 



1 / >2r 

and lim ^- I f[a + 6(cos + i sin B)] dB = /(a) 

^o *v Jo 
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Since the left-hand side of (4.17') is independent of 6, (4.17) must result. 
We can now observe one important consequence of analyticity. To 
evaluate the right-hand side of (4.17), one needs only to know the value 
of /(z) on the boundary F. After the integration is performed, the value 
of /(z) is known in the interior of F. Thus, if f(z) is known to be analytic 
inside F, if /(z) is continuous on F so that Cauchy's theorem holds, then 
the values of f(z) interior to F can be determined if we know only the 
values of /(z) on F. Analyticity is, indeed, a powerful condition. 

Since /'(a) is known to exist, we may hope that /'(a) <<an be obtained 
from (4.17) by differentiating underneath the integral. If this were 
possible, we would obtain 

() d * (4.18) 

v ' 



r (z a) 2 
Let us prove that (4.18) is correct. We have 

dz 



f(a + ft) - /() = 



/(a + h) - /(a) = J^ r _ f(z) dz _ 
h 2iriT (z - a)(z - a - h) 

Since /'(a) = lim - ^ - ^-^ we need only show that 



Ji->0 



- 

- a) (2 - a - h) T (z - a) 2 

to obtain (4.18). Consider 



'. - a)(z - a - ft) r (z - a) 2 ~ T (z - a)*(z - a - h) 
For ft sufficiently small it is easy to see that 



IS 



a) (2 a 

uniformly bounded for all z on F, so that as h > (4.19) holds, which in 
turn yields (4.18). By mathematical induction let the reader show that 



(4.20) 



Equation (4.20) embraces Cauchy's integral formula for /(a) and all its 
derivatives. It is important to note that analyticity of f(z) is strong 
enough to guarantee the existence of all derivatives of /(z). In real- 
variable theory the existence of f'(x) in a neighborhood of x = a in no 
way yields any information about the existence of further derivatives of 
f (x) at x as a. 
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Problems 

1. Let f(z) be analytic within a circle C of radius R and center a, f(z) continuous on 
C. Let M be an upper bound of |/()| on C. Show that 



2. If /(z) is analytic for all z it is said to be an entire function. Use the results of 
Prob. 1 to show that an entire bounded function must be a constant, /(z) is said to be 
bounded if a constant M exists such that |/(z)| < J\f for all z. Hence prove the first 
theorem of Liouville that a nonconstant entire function assumes arbitrarily large 
values (in absolute value) outside every circle with center at z = 0. 

3. Consider the polynomial 

/(z) - aoz n + aiz*" 1 + a 2 z"- 2 + - + a n 
Show that 

I/WI S 

Hence show that for any positive M there exists an 7? such that, for z\ > R, \f(z)\ > M. 

4. Let /(z) be the polynomial of Prob. 3. Assume /(z) 5^ for all z, and consider 
g(z) = l//(z). Why is g(z) an entire function? Use the result of Prob. 3 to show 
that g(z) is a bounded entire function. Since g(z] ^ constant, use the result of Prob. 
2 to deduce that a z exists such that /(z () ) = 0. This is the fundamental theorem of 
algebra (Gauss). Deduce that /(z) has n zeros. 

6. Prove Morera's theorem: If /(z) is continuous in a simply connected region R t 
and if y/(z) dz for every simple closed path in R, then /(z) is analytic in R. 

6. Prove (4.20) for n = 2. 

7. Let <f> n (z), n 1, 2, 3, . . . , be a sequence of functions analytic inside and on 
the simple closed curve T which converges uniformly to <p(z}. Show that <p(z) is 
analytic inside T. 

4.10. Taylor's Expansion. In real-variable theory certain functions 
could be written as infinite series, called Taylor's series or expansion. 
For example, 



1 I I **' I I **' I 

= I + x + jr-. + +.+ 
2! n\ 



\ ** 

= / , 
Lj n\ 



n-O 

T 2 T 3 C 1V 1 ~ 1 J-" 

In (!+)-, -++ + L - +"- 



00 

V (-i) n ^ 2n+i 

"" LI (2n + 1)! 



n 

n-1 



The function /(x) = e~ l/xt , x ^ 0, /(O) = 0, is such that 

/(O) =/(0) -/"(O) = /<>(0) = - - 
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However, this function has no Taylor-series expansion about x = 0. 
We shall find, however, that if f(z) is analytic at z = a, a Taylor-series 
expansion will exist for f(z) at z = a. Remember that analyticity of /CO 
at z = a means differentiability of /CO for all points in some neighborhood 
of z = a. Before proving this result the student should review the 
definitions and theorems involving infinite series (see Chap. 10). The 
definitions of convergence and uniform convergence of series, and the 
theorems regarding term-by-term integration and differentiation of a 
series, apply equally well for an infinite series whose terms are complex. 
We now proceed to the development of Taylor's theorem. Let f(z) be 
a function analytic in an open region /?, and let zo be an interior point 

00 

of R. There exists a unique power series } a n (z ZQ) H such that 



n = 



n=0 



(4.21) 



for all z in some neighborhood of z = z . The series (4.21) for /CO con- 
verges to /CO for all z inside a circle C 
surrounding z = z , and the circum- 
ference of C contains those singu- 
larities of /(z) which are closest to 
z = ZQ. A point zi is said to be a 
singular point of /(z) if /(z) is not 
analytic at z = z\. 

We proceed to the proof. Let C be 
the circle mentioned above. C has 
the property that there is at least 
one point on C at which /(z) is not 
analytic. Moreover /(z) is analytic 
at every interior point of C. Now 
let z be any interior point of C, and 
construct a circle F with center ZQ 

containing z in its interior, F interior to C (see Fig. 4.9). 
Let f be any point on F. From Cauchy's integral formula 

_df 
z 



(4.22) 




FIG. 4.9 



_ 

2 7 r (f - so) - (z - 2 ) 



- (* - .)/(f - *.)] 



Since 
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Z 2o 



< K < I (see Fig. 4.9), we have 

LZJ?V 



r - 

1 



___ 

- (Z ~ 2o)/(f - 2 ( >) 

n-0 



Equation (4.22) becomes 



00 

The uniform convergence of the series y ^ \n+i > ^ ^^ e variable of 

n = 

integration, ^ and z fixed, enables us to interchange integration and 
summation. Hence 



oo 

= ^ a n (z - 20)" 



To show uniqueness, assume 

/(*) = 



n - n = 

for all 2 inside f. For 2 = 20 we have a = 60, so that 



a(2 - 2 ) n = > 6 n (2 - 2 )" (4.23) 

for all 2 inside C. But (4.23) implies 

00 

(Z 2o) > a n (2 2o) n ~ 1 = (2 2o) 
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for all z inside C Y , which in turn implies 

a n (z - so)- 1 = 
for all z inside C with the possible exception of z = ZQ. 

Hence lim Y a n (z - 2o)"~ l = liin Y 6,,0 - 5-0) w 1 

2 Zll ^* Z~2o *-^ 

H 3= 1 f) = 1 

so that i = ?>i. By mathematical induction the reader can show that 
o n = 6 n , n = 0, 1, 2, . . . . Q.E.D. 

Example 4.16. The function ./(z) defined by 

/(z) = K cos // -f /r* sm ?/ 
is easily seen to be analytic everywhere. We have 

/'(z) =7: h t ~ - e r cos y 4- 2e* sin y = /(z) 

nj" dx 

so that by mathemat)(;al induction / (u) (2) = /(z) and / lw) (0) =/(()) = 1. Hence 

00 

Z2 n 
j = e T (cos y + i sin iy) 



We define /(z) to be e* s= e x c iv . We note that, if z is real, /(z) reduces to e x . By direct 
multiplication of the series representing e z and p*i it can be shown that e z e*i e****. 

An easier way is the following: Let the reader 

show that -j- f"~ z = e a ~* for any constant 
dz 




a. Then 

__ ( r - 



Q 



| Z | 



so that e*e a "* is independent of z. Let 
2 = 0, and we have e z e a ~ z ~ e. Now let 
a = 2 -f Zi so that e*e*i * e*+*i. From this 
result we havpe*" cos y -\- i sin y, y real. 
Let the reader show that this result of 
Euler's also holds for y complex. 

Example 4.17. Let us define Ln z as fol- 
lows. Let z be any complex number other 



FIG. 4.10 

than z 0, z r(cos + t sin 0) = re* 9 from Example 4.16, TT < ^ IT. We 
evaluate Ln z s / y along any curve T not passing through the origin or crossing 

the negative x axis (see Fig. 4.10). We can replace T by the path from x 1 to x = r 
followed by the arc of the circle with radius r and center at z = until we reach 2. 
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This yields 

T f r dx . f e ire"? d<f> 

Ln2 = / h / T~ 

Ji x Jo re ttf> 

= In r -f- iO 

= In \z\ -f i Arg 2 

Ln 2 is single-valued and analytic everywhere except at 2 =0, since -7 = -. For z 
real and positive, Ln 2 becomes the ordinary log e x. It is easy to show that 

d* Ln z _ I d n Lnz _ (n - 1)! / j^+i 

r/2 2 z 2 ' ' dz n s n 

( z _ 1)2 ( z __ 1)3 

so that Ln z = fc 1 ) ~ ' ^T ~ " ' * 

Why does the series expansion for Ln 2 converge only for \z 1| < 1? 

If we had not imposed the condition ir < 6 ^ TT, then Ln 2 would not have been 
single-valued. Let the reader show that for this case 



In z = f*Q = l n \ z \ -f t '(Arg z -f 27rn) 



where n is the number of times the path of integration r encircles the origin. If the 
integration is performed in a clockwise fashion, n is negative; otherwise n is a positive 
integer. Let the reader show that 

Ln(-l) -TI 

In 2122 = In 2i {- In 22 -f* zirifi 
Ln xijr<> = Ln xi + Ln x> 2 x\ > 0, x z > 

Ln 2 is called the principal value of In z. We imagine a cut exists along the negative 
x axis which forbids us from crossing the negative x axis. We call the point z a 
branch point of In 2. If we wish to pass from Ln z to In 2, we need only imagine that 
as we approach the cut from the top half of the z plane we have the ability to slide 
under the cut into a new z plane. In this new Riemann surface, or sheet, we have 

ln (1) 2 = Ln 2 + 2iri 
If we now swing around the origin and slide into a new surface, we have 

ln (2 > 2 = Ln 2 + 4rt 
This process can be extended indefinitely. On each Riemann surface 

ln<*> 2 - Ln 2 -f 2*ki 

is a single-valued function. 

Example 4.18. The function f(z) 1/(1 2), z 7* 1, /(I) defined in any way we 
please, has for its only singularity the point 2 = 1. The Taylor-series expansion of 
/(z) about z = must converge for |z| < 1. Indeed 

f(z) (4.24) 

n~0 

converges for |z| < 1 and can be used to represent 1/(1 z) for |z| < 1. Let the 
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reader show that 



a 



1 d" f 



1 \l 
1 - zj Uo 



1 



n = 0, 1, 2, ... 



nldz* \l 

The series expansion of 1/(1 z) about the point z = i/2 should converge for 

i 



since the point z = ?/2 is \/5/2 units 
from z = 1 . We write 




1 
I -z 



I 



nb ! ( 



1 - i/2 

* 



The circles of convergence, 2] = 1 and 
: _ , overlap (see Fig. 411). 

In this shaded region of overlapping, (4.24) and (4.25) converge to the same value of 
1/(1 z). Equations (4.21) and (4.25) are said to be analytic continuations of 
each other. 

Problems 

1. Obtain the Taylor-series expansion of sinh z s= (e* - e~*)/2 about 2=0. The 
same for cosh z s (e* -f e~ f )/2 about z 0. Show that cosh 2 z = 1 -f sinh 2 z, 

~j- sinh z cosh z, -7- cosh z sinh z. 



2. Show that Ln (1 + 2) 



(-1)" 



00 w 

B. Show that N (z n /n\) converges for all z. Multiply the series ^ (z n /w') and 



n-0 



00 

I 



to obtain e*e*i 



4. If one defines tan" 1 2 by tan" 1 z = f , > what difficulties occur? How 

can one make tan"" 1 z a single-valued function? 

5. Show that e 2rtt * 1 if n is an integer. If e +a *, show that a = 2irni, n an 
integer. 

6. Define w \fz as that function w such that w* = z. If z = re**, w? = pe i<p t show 
that p - \/7, ^> - r/2. Since z - re ( *+ 2T) , show that ? 0/2 + r also. For z ^ 0, 
show that w is double- valued. Construct a Riemann surface so that w is single- 
valued. Is the origin a branch point? 

7. If w - Ln z, show that z - e-. Hint: Ln * - In \z\ -f i0, " - <aU!+*. 
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8. Define w * sP, a complex, by the equation w z" e" 1 "'. How can one make 
IP single-valued? Show that for this case -T as** 1 . 

00 

9. Let /(*) be analytic for |z| < JR,/() - Y a* n . Show that, for r < R t 



n-0 



n-0 



10. Let /(z) be analytic in a simply connected region 7? bounded by a simple closed 
path T, f(z) continuous on r, so that Cauchy's theorem applies. Let be an interior 

00 

point of R so that f(z) ^ a n (z z ( >) n . Let C be a circle with center at z<> and 

n = 

radius r, C inside r. Show that |o | 2 + \Qi\*r* + |a 2 | 2 r 4 -f |a |' if it is 
assumed that |/(zo)| *= 1/00 1 f r all z m R> Hence prove the maximum-modulus 
theorem, which states that |/(zo)| ^ |/()| for all z in /if implies 2 on T if/(z) 5^ constant. 

4.11. An Identity Theorem. Analytic Continuation. We have seen 
that, if a function f(z) is analytic? at a point 20, a Taylor-series expansion 
exists. If one desires, then, one could define the class of analytic func- 

00 

tions as the totality of series of the form N a n (z 2 ) n with nonzero 

n = 

radii of convergence. Some of the series would be analytic continuations 
of each other. Thus one could start with a particular analytic function 



f(z) = a n (z ZQ) H which converges for all z such that \z 2o| < r 7* 0. 

n-O 

Now choose a point, z\ t inside this circle. Since /(z) is analytic at z\, we 
can find a series expansion for/(z) 

00 

in the form Y b n (z Zi) n which 

n=0 

converges for all z such that 
\z - zi\ < ri ^ 

This new region of convergence may 
extend beyond the original circle of 
convergence (see Fig. 4.12). 

This process can be continued. p 

Each series is an analytic continua- 
tion of its predecessor, and conversely. All of them represent the original 
f(z), which has now been extended to other portions of the z plane. One 
might naturally ask, if a point z = f is reached by two different paths of 
analytic continuation, do the two series representations thus obtained 
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converge to the same value in their common region of convergence? We 
cannot answer this question until we prove Theorem 4.2. 

THEOREM 4.2. Let f(z) and g(z) be analytic in a simply connected 
open region R. Assume f(z) = g(z) for a sequence of points z\, zz, . . . , 
z n , . . . having z as a limit point, z t , i = 0, 1, 2, . . . , n, . . . , in R. 
Then /(z) s g( z ) in R. 

The proof proceeds as follows: First note that f(z n ) = g(z n ) so that 
/(zo) = lim /(zn) = lim g(z n ) = 0(z ). Moreover, since /(z) and 0(3) are 

n n > 

analytic at zo, 



g(z) = b t (z - 

fc-0 

and /(ZQ) = 0(20) implies ao = />o. Hence 

(Z n - Zo) y a fr (2 n - So)*" 1 = (Z n ~ So) / 



Ar-1 Ar-1 

for n = 1, 2, 3, . . . . This implies 



for n = 1, 2, 3, .... Hence 

oo 

lim Y a t (2 B - zo)*" 1 = lim b k (z n 



which implies ai = 61. By mathematical induction the reader can show 
that a n = 6 n , n = 0, 1, 2; . . . . This shows that /(z) s= gf(z) for some 
neighborhood of ZQ. This neighborhood (a circle) extends up to the 
nearest singularity of f(z) or g(z). Now let f be any point of J?, and 
construct a simple path To from z$ to f lying entirely in /. Let F be the 
boundary of R (see Fig. 4.13). Call the shortest distance from F to F, 
p. The radius of convergence of f(z) about z = 20 is ^ p. Why? Now 
choose a point Zi on F interior to the circle of convergence of /(z) and 
g(z) about ZQ. Since /(z) and g(z) are identical in a neighborhood of z\ y 
we easily prove in exactly the same manner as above that /(z) = g(z) 
for some circle of convergence about z\ whose radius is greater than or 
equal to p. Let the reader show that z = f can be reached in a finite 
number of steps. When f is interior to one of the circles of convergence, 
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/(f) 0(f)> which proves the theorem. It is important to note that at 
each point zo, Zi, 22, , Zk the actual given function /(z) and its deriva- 
tives are used to obtain the Taylor series expansion of /(z). Analytic 
continuation is not used since we are not at all sure that the value of 




FIG. 4.13 

/(z) at z = f would be equal to the value at z = f obtained by analytic 
continuation. That this is true for a simply connected region requires 
some proof. This is essentially the rnonodromy theorem, whose proof 
we omit. The formal statement of the monodromy theorem is this: 

00 

Let R be a simply connected region, and let f(z) = ^ a n (z Zo) n 

w0 

have a nonzero radius of convergence. If f(z) can be continued analyt- 
ically from z along every path in 7?, then this continuation gives rise to a 
function which is single-valued and analytic in R. 

We now state some consequences of the identity theorem : 
(a) The identity theorem holds for a connected region. 
(6) Let f(z) be analytic at z = z . Let zi, z 2 , z 3 , , z n , . . be a 
sequence of points which tend to z as a limit such that/(z n ) = 0, n = 1, 
2, 3, .... Then f(z) = in a neighborhood of z ZQ. 

(c) Let /(z) be analytic at z , f(z) ^ constant. A neighborhood, N, 
of z = zo can be found such that /(zi) ^ /(z ) for z\ in N, z\ 7* ZQ. 

(d) Let /(z) and g(z) be analytic in an open connected region R. 
Assume / (n) (zo) = ^ (n) (^o), n = 0, 1, 2, . . . , z in #. It can be shown 
that /(z) ss gr(z) in 72. 

Problems 

1. Prove (a). 

2. Prove (6). 

3. Prove (c). 

4. Prove (d). 
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5. Consider f(z) = sin 



1 



\z\ < 1. Show that f(z) has an infinite number of 



1 -* 
zeros in the region |z| < 1. Does this contradict (6)? 

6. Let ?(x) be a real-valued function of the real variable x, a ^ x b. Show that, 
if it is at all possible to continue <p(x) analytically into the z plane, the continuation is 
unique. Hint: Assume /(z) and g(z) analytic for z real, a ^ z 6,/(x) - ^(3) g(x). 

7. Use the results of Prob. 6 to show that 



>< - i s 



n-0 



is the only possible definition of e*. 

8. Consider z a ^ aLlw . Define ft as the operator of continuation by starting at 
2 and returning to z, the path of continuation encircling the origin. Show that 



Hint' Ln z In |z| -f- t'0. After encircling the origin In 2 * In |z| + ^0 + 2iri. 

00 

9. Let /(z) be analytic at 2 = Zo so that /(z) =* y a(2 2o) n converges for 

n-O 

|z 2o| < /2. Show that/(z) has at least one singular point on the circle z = Zo -f Re* e , 
0^5^ 2r. Hint: Assume /(z) analytic at each point of the circle, obtain a circle 

of convergence at each point, apply the 
Heine-Borel theorem, and extend the radius 
of convergence, a contradiction. 

4.12. Laurent's Expansion. A 

generalization of Taylor series is due 
to Laurent. Let /(z) be analytic in 
an annular ring bounded by the two 
circles KI, Kz with common center 
z = a. Nothing is said of /(z) inside 
KI or outside K 2 (see Fig. 4.14). 
Now let z be any point in the annular 
ring, and construct circles Ci and 
Cz with centers at z = a such that 
z is interior to the annular ring 
lying between Ci and C 2 . Moreover 

C\ and C 2 lie in the annular ring between K\ and K*. From Cauchy's 

integral formula and theorem 



(4.26) 




FIG. 4.14 



This result can be obtained easily by using the method found in Sec. 4.8B. 
Now 



- 2 



/(f) 



_ _ 

(f - O) - (2 - O) r (f - 0)[1 - (2 - o)/(f - O)] 



Since 
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Z ~~~ CL 



- - 
f a 



< r < 1 f or f on C^ we write 



1 - (z - a)/(f - a) f ~ <* 

n0 

and interchange the order of integration and summation because of the 
uniform convergence of the series. This yields 



(42?) 

-B^8^n -*',*... 

For the second integral 



_ 
c, f - 2 7c, (f - a) - (2 - a) 



(z - o)[ 1 - (f - o)/( - a)] 



Since 



a 



< s < 1 f or f on t'i, we write 



1 - (f - a)/( 2 - a) Zv V - a 
Interchanging integration and summation yields 



o. = 2. Ci (r - a)-'/(f) df n = 1, 2, 3, . . . 

CO 00 

Hence f(z) = Y a n (z - a) n + V a_ n (2 - a)~ n 

n=0 n-1 

= Sj (z - a) + 2 fc a) 

00 00 

where Si(z a) = Y a n (z - a) n Y a^ n (z a)~ n = S*(z a) 



n-l 



Since (2 a)~ (w-f !) /(2) and (z a) n ~ l f(z) are analytic in the annular ring 
between KI and K^ we can choose any path of integration to calculate the 
a n , n = 0, 1, 2, . . . , provided the path F encircles the point z * a 
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exactly once. Hence we can write 



/(z) = > a n (z - a)" 

(4.29) 
/(f ) df 



(f - a) 



n-O +1 +2 

^ - U, 1, J, . . . 



Equation (4.29) is the Laurent expansion of /(z) valid for all z in the 
annular ring between K\ and K 2 . 

We leave it as an exercise for the reader to show that &\(z) converges 
for all z inside 7f 2 and that $2(2) converges for all z outside K\. The 
common region of convergence is the above-mentioned annular ring. 

Example 4.19. Consider /(z) = l/(z l)(z 2). Certainly /(z) is analytic for 
all z such that 1 < |z| < 2. Let us find the Laurent expansion for/(z) in this region. 
It is not necessary to find the a n of (4.29). We wish to write /(z) as the sum of two 
series, one series converging for |z| < 2, the other for |z| > 1. We write 

1 11 1111 



(-l)(s-2) z-2 z-l 21-Z/2 zl-l/z 

oo ac 

, _ V ^L- V 1 

L, 2 n+l Li z n 

n^() n = 1 

The first series converges for \z\ < 2 and the second for |z| > 1. Let us check this 
answer by using (4.29) to find the er r ,. 



where T is any simple path enclosing the origin and lying between the circles |z| = 1, 
|z| - 2. We write, for n ^ 0, 

1 = __^_+_A_++^+ . . . +_!_ MSO^ 

^^ 2 ^ l ; 



IIence _L. ^ ^^--^L,.- . A + C Why? 

To find A, multiply (4.30) by f - 1, and then let f -^ 1. We obtain - 1 = A. It is 
easy to see that B ** l/2 n+1 . To find C, multiply (4.30) by f, and let r -^ . We 
have 0-A+JS + Cso that C - 1 - l/2 n+1 . Hence 

^ - 53 



Let the reader show that a n = 1 for n < 0. 

4.13. Singular Points. We had previously stated that z is a singular 
point of f(z) if /(z) is not analytic at 20. If, moreover, f(z) is analytic 
for some neighborhood of 2 with the exception of 2 , we say that 20 is an 
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isolated singular point. In this case the Laurent expansion of f(z) con- 
verges for < \z Zo| < r, where r is the distance from ZQ to the nearest 
singular point of /(z) other than z itself. We distinguish now between 
3 types of functions which have isolated singular points. 

00 

Case 1. The series f(z) = ) a n (z Z O ) M is such that a n = 0, for all 

L-i 
w-0 

n < 0. We need only redefine f(z) at z to be a and/(z) becomes analytic 
at z = zo. A singularity of this type is said to be a removable singular- 

00 

ity. Thus f(z) = Y (z n /n\) for z ^ 0, /(O) = 2 can be made analytic 

n~0 

00 

at z = by defining /(O) = lim T (z n /nl) = 1. 

*-+o ^ n 

n=0 

Case 2. All but a finite number of the a n , n < 0, vanish. In this 
case we say that ZQ is a pole of /(z). We write 



' w (z - *)" ' 
and zo is said to be a pole of order n. If 



z = ZQ is said to be a simple pole of /(z) . 

The reader can easily verify that |/(z)| becomes unbounded as 
if zo is a pole. A pole is likewise called a nonessential singular point. 

Case 3. An infinite number of the a n , n < 0, do not vanish. In this 
case we say that zo is an essential singular point. For example, 



has an essential singularity at z = 0. An important property of an 
essential singularity is Theorem 4.3, due to Picard, which we state with- 
out proof. 

THEOREM 4.3. In any neighborhood of an essential singularity a 
single-valued function takes on every value, with one possible exception, 
an infinity of times. 

Let us consider e 1/z at z = as an example. Are there an infinite 
number of z in any neighborhood of z = for which e l/z = e 60 ? The 
answer is "yes"! Since e 2Tnt = 1, we have e 1/z +* rni = e 50 ; so the equality 
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holds if l/z n + 2wni = 50, or z n = 1/(50 2irni). Let n = 1, 2, 
3, etc., and we have an infinite sequence of z n which tend to z = as a 
limit and such that e 1/Zn = e 60 . Let the reader show that the same applies 
to e l/ * = i. The exceptional value stated in Picard's theorem is zero. 
There is no z in any neighborhood of z = for which e 1/z = 0. Let the 
reader show this. 

We shall see in Chap. 5 that the point at infinity will play an important 
role in the development of differential equations in the complex domain. 
First we say that the point of infinity is an isolated singular point if f(z) 
is analytic for all z such that \z\ > H > 0. We then replace z in f(z) 
by 1/t and investigate /(I//) at t = 0. The nature of f(\/t) at t = is 
defined to be the nature of f(z) at z = o> . 

Example 4.20. (a) /(z) - 1/z, /(I/O - t, so that, if we define /() =0, /(z) is 
analytic at z . (6) /(z) * z 1/z, /(I//) - \/t t, which has a simple pole at 

00 

t ** 0. We say, therefore, that /(z) has a simple pole at infinity, (c) /(z) = N > 

7J-0 

00 

Zl 
? - Since /(I//) has an essential singularity at t we say that f(z) 

n-O 

lias an essential singularity at z = w . 

Problems 

1. Find the Laurent expansion of f(z) = l/(z 2 -f l)(z - 2) for 1 < |z| < 2. 

2. Find the Laurent expansion of /(z) = -^ ^pry? 4. o\ ^ or < M ^ * ^ or 
1 < |z| < 2; for |z| > 2. 

8. Show that the coefficients in the Laurent expansion (4.29) are unique. 
4. Find all the roots of e* i. 

6. Find a function /(z) which has a simple pole at z = 0, z = 1, and z > . 
6. Define cosh z (e* + ~*)/2. Show that 



(z + *) = fl + Y a 

n rl 

a n = rt~ / cos 

>"* JO 



cosh 

n rl 

cos 9) 



for |z| > 0. 



7. Let f(z) have a simple pole at z ZQ, /(z) = ^ + > a(z Zo) n . We call 

Z ZQ / 
n-0 

a_i the residue of /(z) at z z . Show that o_i lira (z zo)f(z). 

2-+ZQ 

8. Let/(z) have a pole of order A- at z z . Show that the residue a_i is given by 



ffl - j 



. Find the residues of f(z) - *V(c + *)< at its poles. 
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10. Find the residue of /(z) ~l+z+z*B,tz~ . 

11. Let/(z, t) - e^'X'- 1 /*). Show that the Laurent expansion of f(z, t) for |*| > is 



/(Z, t) - 

n - * 

where ,/ = (-!)./(- = ~ cos (w0 - t sin 0) 



12. Let /(z) be analytic in the infinite strip given by a < Im z < a, a > 0. 
Assume /(z) f(z + 2ir). By use of Laurent's theorem show that 



13. Let /(z) be an entire function (analytic everywhere) with a pole of order n at 
z oo . Obtain the Laurent expansion of /(z) for ^ |z| < <*> , and show that 

/(z) = a -f Qiz + + a n z n 

14. Let /(z) be analytic everywhere with the exception of a finite number of poles 
at zi, z 2 , . . . , zt and a pole at z >. The order of the pole at z is a,, z 1, 2, 
. . . , n. Consider 

<^(z) - (z - z,)i(z - z 2 )s - (z - z*)*/(z) 

and show that/(z) is a rational function, the quotient of two polynomials. The result 
of Prob. 13 is useful. 

15. Let p(z) have simple poles at the finite points z , 17, f. Also assume that 
2 z*p(z) is analytic at z oo . Consider 



and show that P(z) has a pole at most of the order 2 at z = oo. Hence show that 

A _J*_ C 

z \ z 17 z f 

where A, B, C are constants. 

4.14. Residue Theorem. Contour Integration. Let z = z be an iso- 
lated singular point of /(z). For /(z) single-valued and analytic in the 
region < \z 3o| < R we have the Laurent expansion 



Let F be a simple closed path encircling z = 2 lying in the region < 
\z - *o| < ft. Then 
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00 

<I>/(z) dz = < Y a n (z z ) n dz 

ao 

V J! 



since 



(n (z zo) n dz = for n ?* 1, and (p = 2wi. The inter- 

y r T Y z ZQ 

change of integration and summation can be justified. We leave this 
as an exercise for the reader. The constant a_i is called the residue of 
/(z) at z = z . 

Let the reader prove Theorem 4.4. 

THEOREM 4.4. Let R be a simply connected open set, and let T be a 
simple closed path in R. Let /(z) be single-valued and analytic in R 
with the exception of a finite number of isolated singular points. Then 

(f) /(z) dz = 2iri (sum of residues of /(z) inside T) (4.31) 

The residue theorem (4.31) is highly useful in the evaluation of real 
definite integrals. We discuss some examples. 

dx 



Example 4.21. Let us evaluate / r- 




We deal naturally with 

/ = r4-., 



Since /(z) 

lim^ -r- 
z-+i 1 + J 



Now 



Now f(z) has a simple pole at z i. To 
apply the residue theorem, we look for a 
path containing z = i in its interior. At 
the same time we desire that part of the sim- 
ple closed path be part of the real axis. We 
choose as T the straight-line segment ex- 
tending from x = R to x ** R and the 
upper semicircle \z\ = R > I (see Fig. 4.15). 
1/(1 -f 2 ) has a "simple pole at z i t the residue of /(z) at z = i is 



= lim 



rfx 



^ (see Prob. 7, Sec. 4.13) Hence 



f*Rie**de _ . 1 

+ Jo i + iw " Jrl 2i " 



(4.32) 



since \R*e** e -f 1| 
infinite, (4.32) becomes 



since lim R*/(R* - 1) - 0. 



I f- M* e M </?/"' de ^ R * 

I 7o 1 + R*e** 9 - K Jo \l -f R*e** 9 \ = R* - 1 

- 1 . fl - 1, fl > 1. If we allow # to become 



dx 
- 1 +a 
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The method used above is known as contour integration. This method can be used 
to advantage to evaluate integrals of the type / f(x) dx. The choice of the closed 

J 80 

contour of integration is not always apparent. One does not always pick a semicircle 
as part of the contour. 

Example 4.22. Let us evaluate / t ^ dx, a positive. Instead of dealing with 

(z sin z)/(z 2 + a 2 ) we consider /(z) = ze"/(z 2 -f a 2 ). Remember that 

e is = cos x -f i sin x 

which introduces the term sin x. e lt is much easier to handle than sin z. As the con- 
tour of integration let us use that of Example 4.21 with R > a, since f(z) has a simple 
pole at z = ai. We apply the residue theorem and obtain 

f R xelt j . f ze " dz o /<we* (a \ - fl 
/ T ; o dx -f / -T- i * = 27T* I . 1 = tire a 
J-Kx 2 +o 2 ./cz 2 +a 2 \ 2at / 

where C is the upper semicircle of Fig. 4.15. 

On C, z = /te', dz = flie'' de, so that -f^rA = ^X? , le 4^' We see that it 

1 2 Z H- a 2 /t z + a 2 

will be difficult to determine lim / Z f . * We abandon the semicircle and choose 
fl-oo JC z* -f a 2 

as our contour of integration the rectangle with vertices at ( 72, 0), (/?, 0), (/2, 
ft -f #0, (-R, -R + /?). Then 



riz xe > r ^w}^^i_dy 

J-R x* + o 2 + Jo (ft + ?//) 2 + fl r 



> dx 



/ft xe tf dx 
2 "_, j so that 
ft 3" i~ fl 



B x 2 + a 2 "" " 
f T sin x efo 
and / ~~~jl sT = TC " 

by equating the imaginary parts. 

Example 4.23. As an illustration of the residue theorem consider 



/(Z) = (2 - 2o)" Ot(z - 2o)* Oo ^ 

A 

We say that z = z is an nth-order zero of /(z). We can also write /(z) * (z 2 )*VC0 9 
^(zo) ^ 0, tp(z) analytic in a neighborhood of z . Let the reader show that there 
exists a neighborhood of z such that ^>(z) 5^ for all z in this neighborhood. Then 



(z) - n(z - z ) w -V(2) + (z - zo)V(z) 
- (z - Zo) 



We see that z 2 is an (n l)st-order root of /'(z) * 0. Moreover 

-f (z - 



/ x - 
g(z) - 



/(z) (z - ZoMz) z - z 
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so that z Z Q is a simple pole of g(z), since v(z Q ) 9* 0. Hence for any simple closed 
path F surrounding ZQ for which <?(&) ^ for all z inside r we have 



Similarly, let /(*) 





(z - 



a* (* - so)* so that 



^>(z) analytic in a neighborhood of z = z , ^(z ) a-m 5^ 0. Let the reader show that 
/'(z)//(z) * w/(z z ) -f v'(z)/<(>(z). m is called the order of the pole at z = 2,,. 
The reader should now be able to prove Theorem 4.5. 

THEOREM 4.5. Let f(z) be analytic in a simply connected set R with 
the possible exception of isolated singular points. Let C be a simple 
closed path in R enclosing a finite number of these isolated singular 
points which are poles, f(z) j on C. Then 



(4.33) 



where N is the number of zeros of /(z) inside C, the order of each zero 
counted in determining JV, and P is the number of poles, the order of 
each pole counted in determining P. 



Problems 



1. Show that 



2. Evaluate 



x*dx 

-oo (Z* + 0*) 8 

x*dx 



f - x 4 -f 5z 2 -f 4 
3. Integrate e*/2 around the rectangle with vertices at z R, R Ri indented 



at the origin, and show that / dx TT, see Fig. 4.16. 

J oo X 



-r Or 
FIG. 4.16 



R 



4. Prove (4.33). 

5. Let /(*) - aoz n -f 



"' 1 -f 



-f On, and let C be a circle \z\ - r such that 
1/00 1 > 1 for \z\ > r. Show that ~~. j) f-jj- dz - n. From (4.33) state a theorem 
concerning f(z) - 0. 
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6. Consider z e* 9 cos B + i sin 6. Show that cos 9 - ?(z -f l/), 
sin - (l/2i) (* - 1/2). 

Use this result to evaluate / =. - - 
Jo 2 + cos 

7. Integrate e~** around the rectangle whose sides are x ft, x #, y 0, 
y b > 0, and show that 



1 cos 2bx dx 



e-** dx 



Then show that 

/ e~ z * dx = ( / db I e~** uos 2bx dx) = 
from (6.78). 

8. For < a < 2 show that f -^~~ t dx = TT ^2 sin ~\ 

f 111 (1 ~f- 3*) 2 

9. Show that / ~ . ~- dx = v In 2. 

Jo 1 -h # 2 

10. For a > show that f ^ 

Jo x 4 



+ a 4 2\/2a s 



11. For a > 0, 6 > show that 



r(o -f 26) 
i& 3 (a -f 6) 2 



12. For a > 6 > show that 



_ . _^_JL^ /^ _ CL\ 

?> 2 ) a 2 b* \ 6 a / 



4.15. The Schwarz-Christoffel Transformation. Let us consider a 
closed polygon in the w plane. We 
assume that the polygon does not 
intersect itself (see Fig. 4.17). 

We wish to find a function 
z = F(w) or w = f(z) which maps 
the polygon into the real axis of the 
z plane. Let PI be mapped into 
(xi, 0), Pz into (rr 2 , 0), etc. As we 
move along the polygon in the w Q 
plane, we move to the right along 
the x axis in the z plane. Now if 
such a transformation exists, then dw = f'(z) dz and arg dw = arg/' (2) + 
arg dz. Along the x axis, arg (h = arg <Lr = 0, so that 

arg dw - arg/' (2) 
along the polygon. Hence 

A arg dw = A arg f'(z) 




FIG 4.17 



164 



ELEMENTS OF PURE AND APPLIED MATHEMATICS 



along the polygon, where A represents an abrupt change in the value of 
arg dw. This occurs, for example, at PI. As we turn the corner at PI, 
arg dw changes abruptly by an amount ai^r, 1 < a < +1. Now con- 
sider f'(z) = (z zi)- ai , i real, so that arg /'(z) = i arg (z zi). 
Thus 

A arg/'(z) = aiA arg (z Zi) 

= ai[arg z>Il (z zO arg f<Zl (z Zi)] 

= -ai(0 - TT) = iri 

This suggests that the transformation we are looking for satisfies 
dw 



A(z- xi)*(z - 



The reader can easily verify that 
A arg dw = A arg/' (2) 
It can be shown rigorously that 

w(z) = A (z - xi)*(z - 



(2 - z n )- a (4.34) 



k = 1, 2, . . . , n (4.35) 



-" (*- x n )-n dz + B (4.36) 

is the required transformation. A and B are constants in the Schwarz- 

Christoffel transformation given by 
(4.36). It can also be shown that 
the interior of the polygon maps 
into the upper half plane of the z 

n 

axis. Since 7 a, = 2, the reader 

ffi 

can easily verify that w(<x>) and 
w(oo) exist. By integrating 
(4.34) around the closed path of Fig. 
4.18, allowing the radii of the small semicircles to tend to zero, and allow- 
ing the radius of the large semicircle to become infinite, the reader can verify 
that w( oo ) = tt?( oo ) , not neglecting the fact that 1 < a t < + 1 , i = 1 , 
2, . . . , n. 

If we desire that x n be the point at infinity, we define z = z n 1/f - 
When z = x, f = . Moreover dz = (1/f 2 ) df so that (4.36) becomes 




(f - Oi)-(f - o,) 

I (z - ai)- a >(* - a 2 )- a ' 
Jo 



(f - an- 



(4.37) 
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since ^ a t = 2. The a t , i 1, 2, . . . , n 1 are constants. Equa- 

i-i 
tion (4.37) is exactly of the form of (4.36) with the term x n = > omitted 

Example 4.24. We consider the polygon of Fig. 4.19. 

Two of the vertices of the polygon are at ir/2 + *>i t ir/2 +001. Let us map P 
into ( a, 0) and Q into (a, 0) of the z plane. At P, a\ ~ , at Q, , so that 
(4.36) can be written 



w(z) 



A (z + a)-l(z - a)~* dz + B 
yo 



\ z * - a 2 
The transformation z = a sin w, dz = a rot- 



M dit, 



oi cos w, yields 

rt 

B 



, 

sin" 1 
i a 



From the conditions (z = a, 
(z a, w =* 7T/2) we have 



1 

sin" 1 - 
a 



2 = a sin 10 (4.38) 



If we let 10 = ir/2 + *#, then 2 = a 

Similarly if we let w = ir/2 -f- iR, we see that 2 

tion (4.38) unfolds the polygon. For z = j 4- t//, ti? 



V 




1 
1 
1 








=-! 




IT 

tt-y 


P 




v-0 


^ t r 


r^=! 


*" Q w 


i 


?IG. 4.19 


(iR) = a cosh /?. As R f 2 > 



oo . 

as /^ . The transforma- 
w -|- w, (4.38) becomes 



-f iy = a sin (u + w) 
x = a sin w cosh t; 
V a cos i/ sinh w 



a sin u cosh v -{- KI cos a sinh t; 



Hence x 2 /(a 2 sin 2 u) y*/(a? cos 2 w) 1 so that the straight lines M Uo ^ 

,/ 

map into hyperbolas in the z plane. Similarly the straight lines v v* map into 
ellipses in the z plane. 

Example 4.25. The Schwarz-Christoffel transformation is very useful in solving 
certain problems in two-dimensional electrostatic theory. First let us note that for 
an analytic function, w /(z) w(z, y) -f- iv(x, y), we have V 2 u and V 2 t; 0, 
(see Prob. 1, Sec. 4.6). If f(z, y) is the electrostatic potential, then V*v 0. The 
lines of force, w(2, y) =* constant, are at right angles to the equipotential curves, 
v(x> y) constant. But for the analytic function, w u + ^> we know that the 
curves w(x, y) * constant intersect the curves v(x, y) constant at right angles 
(see Prob. 1, Sec. 4.6). Thus, to solve a two-dimensional electrostatic problem, we 
need only find w = /(z) * u -h iv such that v(x t y} satisfies the electrostatic boundary 
conditions. Once this is done, E = Vv yields the electric field. Moreover 



m - i^. 



du? 
d2 

Now let us consider an infinite line charge q whose projection in the xy plane is the 
origin. It is easy to show that V 2 w - 0, u - u(r), r - (z 2 + y)i, has the solution 
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u(r) 2q In r. If we consider w 2qi In z, we have 

u + it> 2gi In (re**) i2q In r 2q$ 

so that v 2g In r is the required potential. If the charge were placed at z , then 
u> * 2#t In (z ZQ). A similar line charge, q, placed at ZQ yields 

w 2<7? In (2 2 ) 
The field due to both charges (ran be obtained from 

w * 2qi In (z z ) 2^i In (z 2o) 



= 2<?i In 



z ZQ 



(4.39) 



If we consider a point on the x axis, z * x, then 



t In 



x ZQ 
x -z 



Let the reader show that 2<n In is real, so that v for the x axis. Hence 

x z 

(4.3 )) will yield the electric field for an infinite grounded plane (the x axis) due to an 
infinite line charge placed at ZQ. Remember that the imaginary part of w of (4.39) 
satisfies Laplace's equation and satisfies the boundary condition v = when y 0. 

Now we consider a more complicated example where we shall make 

use of the Schwarz-Christoff el trans- 
formation. Consider two semi- 
infinite grounded planes intersecting 
at an angle ^>. We find w(z) for an 
infinite line charge q placed at ZQ 
(see Fig. 4.20). First we find the 
transformation which maps the pol- 
ygon AOB into the line v(x, y) = 0, 
which is the u axis. We need only 

FIG. 4.20 * apply the Schwarz-Christoff el trans- 

formation (4.36) with z and w inter- 
changed. We map 2 = into w = 0. a, at z = 0, is easily seen to be 
(w ^>)/T = 1 <?/*, so that dz = AoW'*" 1 dw and z = Aw*'*, w = Bz T/ *. 
The charge at ZQ maps into WQ = BZQ*'*. The complex function for an 
infinite grounded plane with charge at WQ has been worked out above. 
It is 




w 



so that W - U + iV 2qi In % ** 

Z """ " ZQ 

V(x, y) is the required potential function. 



(4.40) 
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As a special case let v> = *, 20 = r &- Equation (4.40) becomes 
U + iV - 2qi In LZ_ 

Z T" Tot 

= 2gz[ln (z r t) In (z + r t)] 
= 2<^'[ln \z t>o| + * arg (2 t> )] 

Hence V = 2q[ln \z ir \ In \z + ir \] 

= g In 2 . / . V 2 

For t/ 0, F = g In 1 = 0. 

Problems 

1. What electrostatic problem is solved by the transformation (4.38)? 

2. Map the rectangle of Fig. 4.21 with two vertices at infinity into the real axis of 
the z plane. Show that w (a/V) cosh" 1 z. 

3. Find the electric field due to a charge q placed midway between two infinite 
grounded planes (see Fig. 4.22). 



(w) 



B(o, a) 



FIG. 4.21 



-1 



(2) 



1 
7T> 



o c 



B(o, h) 



FIG. 4.22 



FIG. 4.23 



-1 



C 



4. Map the polygon of Fig. 4.23 into the real axis of the z plane, and show that 
w (&/r)(Vc* - 1 + cosh" 1 z). 

5. Consider a closed curve C given parametrically by x - /(O, I/ - *>(0, a ^ * 6. 
Consider the transformation z x -f iy /(to) -f tV(to). Show that under this 
transformation the closed curve C of the z plane maps into the real axis of the w plane. 
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4.16. The Gamma Function. The Beta Function. It was Euler who 
first obtained a function of the real variable x which is continuous for x 
positive and which reduces to nl when x = n, a positive integer. We 
consider 

e-'t*- 1 dt (4.41) 



The existence of the integral depends on the behavior of the integrand at 
t = and at t = <x . We first write 

T(x) = f l e-'t*" 1 dt + I" e-tt*- 1 dt 

For x > we have that e~ l t x ~ l behaves like t x ~ l for t near zero. It is 

well known that / t x ~ l dt exists f or x > since 
Jo 



lim t x ~i dt = lim I - - - = - " A 

^o 7 *-+o V x/ * z > 

To investigate the second integral, we note that 

e -tl*-\ - -</ 2 (<r-'/2^-i) ij m e-/2j*-i = o 

Hence for t ^ 7 T there exists a constant A such that le"^" 1 ] < Ae~' /2 . 
Since / Ae~ 1 ^ dt exists, of necessity / e^t^ 1 dt exists. We now inte- 
grate (4.41) by parts and obtain 

t x '"* f" t x 
J ' / e- 1 - 

Jo x 



<-00 

+ / <r iV - dt x > 
<-o 



-if 

x Jo 



er'Fdt = r(x + 1) (4.42) 



Thus T(x + 1) = xT(x). If x is a positive integer, x = n, we have 
F(n + 1) = nF(n) = n(n~ l)T(n 1) and, by repeated applications, 
we obtain 

p(n + 1) = n(n - l)(n - 2) 2 ir(l) = n! (4.43) 

since T(l) = J* e~* dt = 1. 

The gamma function of a complex variable z is defined by 

e-'t'- 1 dt (4.44) 

t*~~ l is defined as ^~ 1 = e ln *'~ l = c ( *~ 1)Ln ', where Ln t is the principal value 
of In t. By the same reasoning as above one deduces that T(z) exists for 
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We can write 



= / e-'t*- 1 dt + 
Jo Ji 



T(z) = e-'t*- 1 dt + e-'t*- 1 dt 



V(_lUfc+-l 
^ converges uniformly on (0, 1) for Rl z > 0, we can 

j_^ /C ' 

t = 

interchange the order of integration and summation. Hence 



t/: 



/"" V^ (_ i)* 

Now / c~ l t*- 1 dt exists for all z, and > rA , ,v converges for all z 

k~0 

other than z = 0, 1, 2, . . . , n, . . . . Hence T(z) defined by 
(4.45) is analytic everywhere except at z = w, n = 0, 1, 2, . . . . At 
2 = ft, r(z) has a simple pole. r(s) defined by (4.45) is the analytic 
continuation of T(z) as defined by (4.44). 

The function <p(z) = T (z)T( -z) has simple poles at z = 0, 1, 2, 3, 
. . . and is analytic elsewhere. The function I/ (sin vz) has simple poles 
at z = 0, 1, 2, . . . and is analytic elsewhere. It can be shown that 



(4 ' 46) 



Now the right-hand side of (4.46) never vanishes. Hence, if a zo exists 
such that r(z ) = 0, then, of necessity, F(l ZQ) could not be finite so 
that 1 ZQ is a pole of F(z). Hence 1 z = ft and zo = 1 + n, 
F(ZO) = T(l + n) = n! ?^ 0, a contradiction. Thus T(z) has no zeros. 
We state without proof Legendre's duplication formula, 

i) (4.47) 



170 ELEMENTS OP PUKE AND APPLIED MATHEMATICS 

An important result is Stirling's approximation to n\ torn large. It is 



n\ V2irn(~ 1 (4.48) 

This result will be obtained in Sec. 10.25 by use of the Euler-Maclaurin 
sum formula. 

The beta function defined by 

B(p, q) = fc V>~ l (l - O'- 1 dt (4.49) 

is closely related to the gamma function. The convergence of (4.49) 
exists if Rl p > and Rl q > 0. We choose 



One can show with some labor that 



The substitution u = (1 - OA or Z = 1/(1 + w) in (4.49) yields 

v / u p ~ l du 

- 9) - Jo (T 



Thus r(*)r(l - z) = I ~- du < Rlz < 1 

Jo I + u 

The substitution u = e' yields 



/- ^ 

To evaluate / t dt one applies the theorem of residues to the 

J -* "i ^ 

closed rectangle C with vertices at / = #, / = S, t = S + 27ri, 



ft and S real and positive, / the complex variable of integration. The 
pole of e"/(l + e') inside this contour exists at t = in. The residue of 
+ eO at < = id is 



By letting R and S tend to infinity and using the fact that < Rl z < 1 
one can obtain T(z)T(l - z) = T/sin ** [see (4.46)]. By analytic con- 
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tinuation it follows that (4.46) holds throughout the domain of definition 
of I/sin irz. 

Problems 

1. Show that r(i) - VT- 

2. Prove that (2n)\ = 2*n\T(n + )*-*. 

3. From (4.45) show that lim (z -f n)V(z) =* ( l)"/n!, n a positive integer. 

jr * n 

4. Show that 



/T 



/2 . 

sin 2 "" 1 cos 2 " l 0dO = ?R(p, q] 



for Rl p > 0, Rl ? > 0. 

5. Integrate t*~ l e~ l around the complete boundary of a quadrant of a circle indented 
at the origin, and show that 



for < Rl z < 1. 
6. Show that 
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CHAPTER 5 

DIFFERENTIAL EQUATIONS 



6.1, General Remarks. A differential equation is any equation involv- 
ing derivatives of a dependent variable with respect to one or more 
independent variables. Thus 



+ 



+ x = & (5.4) 

are classified as differential equations. Equation (5.3) is called a partial 
differential equation for obvious reasons. The others are called ordinary 
differential equations. One may have a system of differential equations 
involving more than one dependent variable [see (5.5)], 

*L * _ (5 ' 5) 

In addition to their own mathematical interest differential equations 
are particularly important since the scientist attempts to describe the 
behavior of certain aspects of the universe in terms of differential equa- 
tions. We list a few of this type. 



dx 1 

- + X = Acoso)t (5.7) 



(58) 

(i} -* } 



-0 (5-9) 

a = (5.10) 
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The order of the highest derivative occurring in a differential equation 
is called the order of the differential equation. 

5.2. Solution of a Differential Equation. Initial and Boundary Con- 
ditions. Let a differential equation be given involving the dependent 
variable <p and the independent variables x and y. Any function <p(x, y) 
which satisfies the differential equation is called a solution of the differ- 
ential equation. For example, it is easy to prove that <?(x, y) = e* sin y 



satisfies VV = -r + - = 0- We say that e* sin y is a solution of 
ax 1 dy* 

VV = 0. It is important to realize that a differential equation has, in 
general, infinitely many solutions. For example, y" = admits any 
function y = ax + b as a solution, where a and b are constants which 
can be chosen arbitrarily. To specify a particular solution, either initial 
conditions like 

y = 2, T/' = 1 when x = 3 

or boundary conditions like 

y = 2 when x 3 and y = 4 when x = 1 

must be given in addition to the differential equation. In the first case 
y = x \ is the solution, and in the second case y = ?x + is 
the solution. A full study of a differential equation implies the deter- 
mination of the most general solution of the equation (involving arbitrary 
elements which may or may not be constants) and a discussion of how 
many additional conditions must be imposed in order to fix uniquely the 
arbitrary elements entering in the general solution. Without attempting 
an exact statement or proof, at the moment, we state the fact that "in 
general' ' the most general solution of an ordinary differential equation 
of order n contains exactly n arbitrary constants to be uniquely deter- 
mined by n initial conditions. 

Problems 

1. Verify that the following functions arc solutions of the corresponding differential 
equations: 

(a) y = z 2 - e* + 1 y'" + xy" - y + | * - i *(! - e*) 

... 9 d z u . du du 0/ , ,, 

(6) u-x*-y w+*Txdj - 2 ( 1 -* 1 ) 

(c) y + x + 1 = (y - x}y' - (y* - x 2 ) = 

2. Verify that y = ae x 4- be** is a solution of the differential equation 

y" - 3i/' + 2y - 

Determine the particular solution which satisfies the initial conditions y ** 0, y' 3 
for x = 0. 

3. Find the solution of y' + 3xe* which satisfies the initial condition y 1 
when x * 1. 
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5.3. The Differential Equation of a Family of Curves. The family of 
straight lines in the plane is characterized by the equation y = ax + b, 
where a and 6 are arbitrary constants. To fix these constants means to 
select one member of the family, that is, to fix our attention on one 
straight line among all the others. The differential equation y" = OJs 
called the differential equation of this family of straight lines because 
every function of the form y = ax + b satisfies this equation and, con- 
versely, every solution of y" = is a member of the family y = ax + b. 
The differential equation y" = characterizes the family as a whole 
without specific reference to the particular members. 

More generally, a family of curves can be described by 

y = /(x, ai, a 2 , . . . , a) 

or implicitly by F(x, y, ai, a 2 , . . . , a n ) = 0, in which n arbitrary con- 
stants appear. The differential equation of the family is obtained by 
successively differentiating n times and eliminating the constants 
between the resulting n + I relations. The differential equation that 
results is of order n. 

Example 5.1. To find the differential equation of the family of parabolas 

y ~ ax + bx* 
we differentiate twice to obtain 

y' - a + 2bx 
y" - 26 

The last equation is solved for b, and the result is substituted into the previous equa- 
tion. This equation is solved for a, and the expressions for a and b are substituted into 
y ax + bx*. The result is the differential equation 

y - xy' - \x*y" 

The elimination of the constants a and b can also be obtained by considering the 
equations 

xa + x*b + (-r/)l - 

tt + 2x6 + (-y')l - 

2b 4- (-!/")! - 

as a system of homogeneous linear equations m a, 6, 1. The solution (a, 6, 1) is non- 
trivial, and hence the determinant of the coefficients vanishes. 



x x z y 
I 2x -y' 
2 -y" 



- 



Expansion about the third column yields the result above. 

Problems 

Find the differential equations whose solutions are the families 

1. y - Cie* + C*r* 

2. y Ci cos 2x -f C* sin 2x 
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3. y - *(Ci + C 2 z) 

4. Find the differential equation of all circles which have their centers on the y axis. 

5. Find the differential equation of all circles in the plane. 

6. Find the differential equation of all parabolas whose principal axes are parallel 
to the x axis. 

7. Find the differential equation of all straight lines whose intercepts total 1. 

6.4. Ordinary Differential Equations of the First Order and First 
Degree. Let y be the dependent variable, and let x be the independent 
variable. The most general equation of the first order is any equation 
involving x, y, and y'. If y' enters in the equation only linearly, that is, 
only in the first power, the equation is said to be of the first degree. Such 
an equation can be written in the form 



We shall see later that under suitable restrictions on f(x, y) there always 
exists a unique solution y = <p(x), such that 2/0 = ^(#o) and 



The discussion of (5.11) will be restricted to the simplest cases at 
present : 

(a) f(x, y) = a(x)0(y). 

(fr) f(x, y) is homogeneous of degree zero, that is, f(tx y ty) = t n f(x, y), 
n = 0. 

(c) Exact equations. 

(d) Integrating factors. 

(e) f(x, y) = -p(x)y + q(x). 

Case (a): Separation of Variables. If 

dy _ M(x) . . 



then M(x) dx + N(y) dy = 0. Consider 

F(x 9 y) = |* M(x) dx + f v N(y) dy = constant (5.13) 



The integrals in (5.13) are indefinite (no lower limit). We wish to show 
that y as a function of x given implicitly by (5.13) satisfies (5.12). We 
have 

^ + ^^ = dy = - dF/dX 5*0 
dx dy dx dx dF /dy dy 

But I? = *& = "<*> 8 that %--' Q - RD - Con " 
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versely, let y = y(x) be any solution of (5.12), so that 

dy(x) _ _M(x) 



dx 
Consider F(x, y(x)} = f* M(x) dx + f y(x) N(y) dy. We have 




from (5.14) so that F(x, y(x}} constant. Hence any solution of (5.12) 
can be obtained from (5.13) by solving implicitly for y in terms of x. 

Example 5.2. We solve y dx -f (1 -f x 2 ) tan" 1 x dy = 0. Separating the vari- 
ables, we have 

dx , dy _ ~ 

(1 + x*) tan- 1 x y ~ 

Integration yields In tan" 1 x + In y = C\ = In C 2 so that y tan -1 x - C 2 or 

y = Cjftan- 1 x)' 1 
Problems 

dy =_0 

2 VxeW dy = 
y(x* + 1) d# - 



6. For what curves is the portion of the tangent between the axes bisected at the 
point of contact? 

6. Find the general equation of all curves for which the tangent makes a constant 
angle ^> with the radius vector. 

7. Find the function which is equal to zero when x 1, and to 1 when x = 4, and 
whose rate of change is inversely proportional to its value. 

8. Find r(x) if r - v^ 7 ^. 

ax 

9. Find the family of curves intersecting the family of parabolas y* ** 4px at right 
angles for all p. These curves are called the orthogonal trajectories of the system of 
parabolas. 

10. The area bounded by the x axis, the arc of a curve, a fixed ordinate, and a 
variable ordinate is proportional to the arc between these ordinates. Find the equa- 
tion of the curve. 

11. Assume that a drop (sphere) of liquid evaporates at a rate proportional to its 
area of surface. Find the radius of the drop as a function of the time. 

12. Brine containing 2 Ib of salt per gallon runs at the rate of 3 gpm into a 10-gal 
tank initially filled with fresh water. The mixture is stirred uniformly and flows out 
at the same rate. Find the amount of the salt in the tank at the end of 1 hr. 

13. It begins to snow some time before noon and continues to snow at a constant 
rate throughout the day. At noon a machine begins to shovel at a constant rate. 
By 1400 two blocks of snow have been cleared, and by 1600 one more block of snow is 
cleared. What time before noon did it begin to snow? Assume width of street a 
constant. 
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Case (6). We say that /(z, y) is homogeneous in x and y of degree n 
if f(tx, ty) = t n f(x, y). For example, f(x, y) = x 2 + y 2 is homogeneous 
of degree 2 since /(to, ty) = tV + t*y 2 = / 2 (z 2 + i/ 2 ) = t z f(x, y). We 
now consider 

Tx = /( *> y) (5 ' 15) 

where /(to, ty) = < n /(x, y). We let T/ = to, so that -p = t + x-j 

&X uX 

Moreover /(x, to) = x n /(l, <) so that (5.15) becomes 

t + X Tx = a ^ (1 '' ) ( 5 - 16 ) 

If n = 0, that is, if /(x, y) is homogeneous of degree zero, then 
dt dx 



/(I, t) -t x 



/(I, - < * (5.17) 



and the variables have been separated. We can solve for t = t(x), and 
y xt(x) satisfies (5.15). In particular, if 

dy = M(x, y) 
dx N(x, y) 

and M(x, ?/), N(x, y) are homogeneous of the same degree, then M/N is 
homogeneous of degree zero, so that the substitution y = to yields an 
equation in t and x with variables separable. 

Example 5.3. Consider 



. 

dx x - y 
x> y) = (x 4- 2/)/(z T/) is homogeneous of degree zero. Let y te, so that 

, , d _ x + fcr _ 1 -f- / 



tan~ l t - ? In (1 + t 2 ) In x -f In C 

tan- 1 - - \ In ( 1 + ^ ) In Cx 
x \ x / 



Problems 



^ dy = yi_ 

6tX X ~~" ^ 



2 ^ y 4- x* + 



d!x x 

rfy _ x + y 
S ' dx ~ x 
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4. ^ x + y + * Hint: Let x = * -f a, y = y + 6, and find a and 6 so that 
ax x y T 



5. Discuss (5.15) if /(I, = t in (5.17). 

Case (c): Exact Equations. We consider 



_ 

d* tf (*, y) 

or M(x, y) dx + N(x, y) dy = (5.18) 

We say that Eq. (5.18) is exact if there exists a function <p(x, y) such that 
d<p(x, y) = M(x, y) dx + N(x, y) dy (5.19) 

If (5.18) is exact, then d<p = and <p(x, y) constant is a solution of 
(5.18). From the calculus 



so that for (5.18) to be exact we must have 

|f = M(x, y) 

(5.21) 



Further differentiation yields 

ay ay 



dy dx dy dx dy dx 
If we assume continuity of the second mixed partials, of necessity 



dx 



(5.22) 



Equation (5.23) is a necessary condition that (5.18) be exact. 

Conversely, assume = -T We sh 
dy ox 

such that d<p = M dx + N dy. Consider 



Conversely, assume = -T We show that a function p(x, y) exists 
dy ox 



(5.24) 
o and t/o are arbitrary constants** Then 
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^ = M(x, y) 

-^ dx + N(XQ, y) 
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** = r 

dy Jxo 

-r. 



, y) + N(x, y) 



= N(x, y) - 
= N(x, ij) 

Hence (5.24) yields the required function v?(j, y). 

If this seems familiar, there is a good reason, since the same material 
was covered in Chap. 2, Vector Analysis. Let f = M(x, y)i + N(x, y)j. 
Then f dr = M dx + N dy. If f = V<p, then dtp = M dx + N dy. But 
f = V<p implies V X V<p = 0, which is statement (5.23). 



Example 5.4. Consider 



(In x +\ny + })dx + -dy =0 



(5.25) 



1 , -, dM I , T X dN ] dM . A ._ __ N . 

In y + 1, = - TV = -> -r = - = -r > so that (5.25) is exact 
dy y y dx y dy 



M = In x 

Applying (5.24) yields 

<f>(?, y) = I (In x +ln y + I) dx + I - dy = constant 

Integration yields 

y.y 



<?(j, y) * x In 
a: In a: a; 



In 



|a? 1 



+ 



l+xiny \ny-\-x \-}-\ny**c 
x In (xy) = c 



We chose the lower limits to be 1 rather than zero since In x is not defined at x 
The reader can check easily that 



0. 



d[x In (xy)} 



(In a- -f In y + 1) d* + - 



Problems 

1. (xe* -f 2 cos i/) dx 2x sin y dy 



2. 



= o 



\ o **' / 

z 4- 2/) 2 (x + ; 

3. sin y e x wn v dx -f (ze x Bin w cos y 2 cos y sin y) dy - Q 

Case (d): Integrating Factors. It may readily turn out that (5.18) is 

not exact, that is, -r ^ Let us assume that this is so. Perhaps 
dy ox 

one can find a function n(x, y) such that 

dx + uNJy = (5.26) 
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is now exact. We call p(x, y) an integrating factor of (5.18). The solu- 

flii ii/f 

tion of (5.26) is also a solution of (5.18) since in both cases -~ = *& 

ax N 

In general it is very difficult to find an integrating factor. There are two 
cases, however, for which an integrating factor can be found. Let us 
determine the condition on M and N such that \L = n(y) is an integrating 
factor. Of necessity 



, dp ,, , dM dN 

and _ M + __ = __ 



so that - 5= = t " T/ " ' " (5.27) 

nay M 

If Tf is a function of y only, then (5.27) can be solved for ?/, 

for in this case 

<M _dM 
dx dy 



f 

M(?/) X P / 



- 



, 
dy 



Example 5.5. Consider 

-ydx + xdy = (5.28) 

We have = 1, - 1 - = 1, so that (5.28) is not exact. However, 

dN/dx - dM/dy __ __ 2 
M ~ y 

so that an integrating factor exists, namely, 



Multiplying (5.28) by 1/V yields 

- - dx -f - 2 d^ =0 (5.29) 

The reader can easily verify that (5.29) is exact. The solution of (5.29) is x/y con- 
stant, which is also a solution of (5.28). 

For an integrating factor of the type n(x), one needs of necessity 

dM/dy - dN/dx 

- - ss <p(jc) (5.30) 



Problems 



1. 2xy dx + (2/2 - 3x 2 ) <fy - 

2. (x - xy*) dx -f x*y dy 
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3. Prove the statement concerning (5.30). 



Case (e): Linear Equations of the First Order. The equation 

j| + P(x)y = q(x) (5.31) 

is called a linear differential equation of the first order. The expression 
"linear" refers to the fact that both y and y f occur linearly in (5.31). 
The functions p(x) and q(x) need not be linear in x. If q(x) = 0, (5.31) 
is said to be homogeneous; otherwise it is inhomogeneous. 
The solution of the homogeneous equation 

j| + p(x)y = 

is trivial since the equation is separable. Hence 

y = A exp [ /p(x) dx] (5.32) 

where A is a constant of integration, fp(x) dx is an indefinite integral. 
Now we attempt to use (5.32) in order to find a solution of (5.31). We 
introduce a new variable z(x) by the equation 

y = z exp [-fp(x) dx] 
or z = y exp Jp(x) dx 



-J? 



where y(x) is a solution of (5.31). Then 

Jx = df 6Xp j P ^ dx + yp exp J p( 
But ^ = q(x) - p(x)y from (5.31) so that 

p2/) exp / pdx + py exp / 
exp / p(x) dx 

Hence z(x) = / q(x) (exp / p(x) dx] dx + C 

so that 

y(x) = exp [ - J p(x) dx] \J q(x) (exp J p(x) dxjdx + C J (5.33) 
The reader can easily show that (5.33) is a solution of (5.31). 
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Example 6.6. Consider 

dy 

-f- + y cot x x l csc x 

ax 

Here p(j) cot x, ?(x) * j* cscz so that (5.33) becomes 

y(x) = exp f / cot j dxj / x* esc x (exp / cot x dx J dx -f C 

o fe lnn- fa 4. 



/ f 



Example 5.7 We now investigate (5 IW) in more detail. We have 

l/(x) - C exp |-/p(jr) dj] -f exp [-Jp(x) rfx][J(/(x) (exp Jp(x) dx) dx] 
- i/i(*) -f 



We notice that j/i(x) C exp ( ~/p(x) dxj is the general solution of the homogeneous 
equation ~ -f- p(x)y 0. Moreover j/a(x) is a particular solution of (5.31) obtained 
from (5.33) by choosing C In other words we have Theorem 5.1. 

THEOREM 5.1. The most general solution of the inhomogeneous equa- 
tion is the sum of a particular solution plus the general solution of the 
corresponding homogeneous equation. 

This important result is valid also for linear equations of higher order 
and is discussed later. In simple cases a particular solution can be found 
by inspection. For example, consider 

te + y-* ( 5 - 34 ) 

We look for a particular solution of the form y = ax + b. Then 

o + aj + 6 = x 

holds if a + 6 0, a =. 1 so that 6 = 1 . Hence y = x 1 is a 
particular solution of (5.34). This method of obtaining a particular 
solution is called the method of undetermined coefficients. The general 

solution of the corresponding homogeneous equation, ~ + y = 0, is 

y *= ce~~*, so that 

y * <#- -f x 1 

is the general solution of (5.34). 

Problems 
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8. x -r y * c 08 x 
ox 

4. Change + p(x)y g(x)y" to a linear equation by the substitution z y 1 "", 
n P* 1. What if n - 1? 

Review Problems 



2. A body falls because of gravity. There is a retarding force of 1 riot ion propor- 
tional to the speed of the body, say, kv. If the body starts from rest, find the distance 
fallen as a function of the time. 

8. (x + y) dx + x* dy - 

4. [sin (xy) + xj/ cos (xy)] dx -f- x 1 cos (xy) dy 

5. t-**(2xy - 2*V) cte 4- <T*'*(3x - x*y) dy - 

6. Brine containing 2 Ib of salt per gallon runs at the rate of 3 gpm into a 10-gal 
tank initially filled and containing 15 Ib of salt. The mixture is stirred uniformly and 
flows out at the same rate into another 10-gal tank initially filled with pure water. 
This mixture is also uniformly stirred and is emptied at the rate of 3 gpm Find the 
amount of brine in the second tank at any time (. 

6.6. An Existence Theorem. We consider the equation 

5- - f(x, y) (5.35) 

We wish to show that if /(x, y) is suitably restricted in a neighborhood 
of the point PO(ZO, 2/o) there exists a unique function y = y(x) which 
satisfies (5.35) such that y = t/(xo). The restriction we impose on 
/(x, y) is the following: Assume that a constant M ^ QO exists such that 
for all points P(x, y), Q(x, z) in some neighborhood of P O (XO, yo) we have 

!/(*,) -f(x,y)\ <M\z-y\ (5.36) 

A function /(x, y) satisfying (5.36) is said to obey the Lipschitz condition. 

An immediate consequence of (5.36) is the following: If ^ exists in a 

dy 

neighborhood of PO(X O , 2/0), then ~ is uniformly bounded in that neigh- 
borhood, for [from (5.36)) 



= lim 



;/(.) -f(x, y) 

I 2 - v 



<M 



Conversely, if ^ is continuous for a closed neighborhood of /V#o, yo), 

then /(x, y) satisfies the Lipschitz criterion for that neighborhood. 
Remember that a continuous function on a closed and bounded set is 
uniformly bounded over the set. We assume further that /(x, y) is 
continuous so that the integrals of (5.37) exist. 
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The proof of the existence of a solution of (5.35) is based upon Picard's 
method of successive approximations. Define the sequence of functions 
yi(x), y*(x), . . . , i/nGr), ... as follows: 



/ * f( x , 2/o) dx + 2/0 

Jxo 

2/200 = /,*/(*> yi(*}) dx + 2/0 

y z (x) = [*f(x, 2 (j)) dx + </o 

Jxo 

................. (5.37) 

n -i(x) = / */(r, 2/n-2(x)) dx + ?/ 

Jxo 

J/nCr) = / * f(x, ?/n-iOr)) ^ + 2/0 

Ja-o 



The y/,(a-), i = 1, 2, . . . , n, . . . , are obtained in a very natural 
manner. In /(j, ?/) we replaco y by the initial constant ?/ , integrate, and 
obtain t/i(j) such that ?/i(x ) = z/o. We now replace y inf(x, y) by */iU), 
integrate, and obtain i/z(x). This process is continued indefinitely. The 
next endeavor is to show that the sequence thus obtained converges to a 
function t/(x) which we hope is the solution of (5.35). Let us note that 

y n (x) S //o + (?/, - |/ ) + (2/2 - y\) + ' ' + (//n - l/n-l) 

Hence the investigation of the convergence of the nequence hinges on the 
convergence of the series 

2/o + (2/1 - 2/o) + (2/2 - yi) + ' ' + (2/ - //-i) + (5.38) 

Let A' be an upper bound of f(x, y} in the neighborhood of PoCr , 2/o) 
for which the Lipschitz condition holds. Then from (5.37) 



A> - x | (5.39) 

! / 

Also from (5.37) 

\y t (x) - y t (r)\ = [f(x, ,) - /(ar, y.)] rfx 



from the Lipschitz condition. Applying (5.39) yields 

|i(i) - J/I(JT)| < -WA' j f* |* - ioi rfx 
y 



< J " tK TT 
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Let the reader show that 



and by mathematical induction that 

1 ~ (5.40) 



Hence each term of the series (5.38) is hounded in absolute value by the 
terms of the converging series 



Since the series for <> M]T rof converge?; uniformly in any closed and 
bounded set, the series (5.38) converges uniformly for those x in the region 
for which the Lipschitz condition holds. We have 

lim y n (x) = //(x) 

n 

and the convergence is uniform. From 

y*(x) = / T /(J", 2/-iGr)) dx + y<> 

Jxo 

we have 

y(x) = lim y n (x) = lim J r *f(Xi //n-i(:r)) dx + y 

= jj lim /(x, 2/_i(x)) c/x + i/o 

because of uniform convergence. The Lipschitz condition (5.36) also 
guarantees continuity of /(x, y) with respect to y, for 

lim |/(x, 2) /(x, y)\ = 
since 3/|s y\ > as z * y. Hence 

f(x, i/(x)) dx + i/o 
and ~ = /(x, y(x)) y(x ) * 2/0 



so that y(x) satisfies (5.35) and the initial condition. Q.E.D. 

We must now show that y(x) is unique. Let z(x) be a solution of (5.35) 
such that Z(XQ) y*. Then 

~ /(x, ^(x)) 
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and integration yields 



( x > *(*)) dx + y* 

Hence z(x) - y(x) = f * [/(*, 2(2:)) - /(s, t/(*))] 



(5.41) 

\z(jc) -y(x)\ <M \z(x) -y(z)\dx 



by applying the Lipschitz condition. Let L he an upper bound of 

\z(x) y(x)\. Then 

\:(x) - y(x)\ < ML\x - xo| 

and, applying (o 41) successively, we obtain 
\z(x) - y(x)\ 



|z(ar) - y(x)\ < M"L L 



V 

/ 

/^ 

converge, we have 



a . B j - J n . 4l 4l 4 r "" <> u- u i 

Since , is the nth term of / -, which is known to 

;? ! 



, - , = 

-* w! 

for any fix( k d .r. Hence the difference between z(x) and y(x) can be made 
as small an w< please. f l"his means that z(x) s= y(x). Q.E.D. 

Problems 

1. Show that /(x, !/) x?/ satisfies the Lipschitz condition for |x| < A < . 

2. Show that (5 40) holds. 

8. Show that /(J, //) J" sin ?/ does not satisfy the Lipsdutz condition for all x. 

4. Consider ~ j/, y(0) * 1. Obtain the sequence (5.37), and show that 

hm j/nU) - e* 
-* 

6. CoiiHider ^ - x -f t/ f , j/(0) = Obtain y t (x), y t (x), y s (x), j/*(x) from (5.37). 
6. (Consider the system 

g-.ru.,,*> 



Impose suitable restrictions on /(x, y, 1} and ^(j, y, f ) in a neighborhood of the point 
^(ir, Vo, *), and obtain a sequence of functions j/i(x), Vi(x), . . . , y.(x), . . . , and 
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a sequence of functions Zi(x), z*(x), . . . , *(), . . . , such that lira y(x) j/(x), 

n 

lim z n (x) - z(x) t where t/(x) and z(x) satisfy (5.42). Show that y(x) and z(x) are 

n 

unique if y(x ) yo, z(x ) ~ 2 o- 

5.6. Linear Dependence. The Wronskian. A system of functions 
y\(x), 2/2(2;), . . . , 2/n(#), o ^ ^ ^ 6, are said to be linearly dependent 
over the interval (a, 6) if there exists a set of constants r jt r 2 , . . . , c n , 
not all zero, such that 

n 

c^(x) m (5.43) 

for all x on (a, 6). Otherwise we say that the y(x) are linearly inde- 
pendent. Equation (5.43) implies that at least one of the functions can 
be written as a linear combination of the others. Thus, if c\ ^ 0, then 



Linear dependence will be important in the study of linear differential 
equations. Let us find a criterion for linear dependence. Assume the 
y,(x) of (5.43) differentiate n I times. Then 

y(x) - j 0, 1, 2, . . . , n - 1 (5.44) 

by successive differentiations. The system (5.44) may be looked upon as 
a system of n linear homogeneous equations in the unknowns r,, i = 1,2, 
. . . , n. Since a nontrivial solution exists (remember not all c vanish), 
we must have 

\y?(*)\ = o 
or 



W(y t ,y,, 



/,(*) 



(5.45) 



Determinant (5.45) is a necessary condition that the ?y,(j) be linearly 
dependent on a <J x ^ 6. This important determinant is called the 
Wronskian of y it y^ . . . , y n . If W(yi, ?/ 2 , . . . , //) ^ 0, the # t are 
linearly independent. 
Let us now investigate the converse. We take first an easy cae. Let 
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for a g x b. Assume yi(x) ^ for a < x g b. 
would be obtained if we assumed y*(x) -^ for a g x 



The same result 
6. Then 



2/12/2 - 2/22/1 

y\ 



o 



and i~ ( ~ ) = so that 2/ 2 = cy\. which shows that y\ and 
dx \2/i/ 



are linearly 



dependent. 

The proof for the general case is not so easy. We assume that (5.45) 
holds for a ^ x g 6. Furthermore we assume continuity of the deriva- 
tives, along with the assumption that at least one of the minors of the last 
row of (5.45) does not vanish for a ^ x ^ 6. For convenience we assume 



0(2) 



1/2(3) 

2/2(2) 



2/3(2) 



2/n(2) 



.n 
2/3 



for a ^ x g 6. We now show that under these conditions the y l are 
linearly dependent over the range a ^ x ^ 6. First expand (5.45) about 
the first column to obtain 



= 



Hence 



is a solution of the linear differential equation 



*~ l y , pi 
n ~ l 



dx 



, 
n ~ l 



+ -r 1 y = 



(5.46) 



Moreover, if we replace y\(x) by y^x), y[(x) by 7/J(x), etc., in the first 
column of (5.46), the determinant vanishes since two columns are the 
same. Hence (5.46) is also satisfied by y*(x). With the same reasoning 
we find that y\(x), 2/2(2), 2/3(2), . . . , y n (x) are solutions of (5.46). The 
reader can verify easily that any linear combination of y^ t/ 2 , . . . , y n 
is also a solution of (5.46). Now we fix our attention at a point 2 , 
a g XQ g b. We have 



2/1(^0) 2/2(20) 



= 



so that the system of homogeneous equations 

n 

/^ Ct^'(#o) =0 j = 0, 1, 2, . . . , n 1 



(5.47) 



f-1 



has a nontrivial solution in the c,-. Consider a set of c not all zero which 
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satisfies (5.47), and form 

n 

y(x) = ^ c *^ x ) ( 5 - 48 ) 

Equation (5.48) is a solution of (5.46). Moreover y(x Q ) = 0, y'(x ) = 0, 
. . . , 2/ (n ~ 2) (#o) = as well as y (n " 1) (xo) = 0. It will be shown in 
Sec. 5.7 that Eq. (5.46) has a unique solution when n 1 initial condi- 
tions are imposed. Now y(x) = certainly satisfies (5.46) and the 
initial conditions y(xo) = ?/'(o) = y"(xo) = = i/ (n ~ 2) (x ) = 0, so 
that y(x) of (5.48) is identically zero for a ^ x ^ b. Hence 

n 

J c# t (x) m 

for a g x ^ b. Q.E.D. 

Problems 

1. Show that sin x and cos x are linearly independent. 

2. Show that sin x, cos x, e are linearly dependent. Assuming e**/ 2 i, show 
that e tx a* cos x + * sin x. 

3. Consider yi = x 2 , ^ 2 - x|x|, -1 ^ x ^ 1. Show that Tf (y lf y 2 ) - for 
1 ^ x ^ 1. Does this imply that y\ and ?/ 2 are linearly dependent? Show that 
7/1 and yz are not linearly dependent on the range 1 ^ x ^ 1. Does this contradict 
the theorem derived above? 

4. Let i/i (x) and y 2 (x) be solutions of 

y" + p(x)y' -f q(x)y = 

for a ^ x ^ b. Show that -^ 1? + p(x)TT(?/i, |/ 2 ) = and hence that 
W(y\, y>t) A exp [ / p(x) dx] 



Show that if W for a: = x then TF = for all x on a ^ x ^ b. How does one 
determine the constant A ? Show that if W 7* for x = #0 then W ^ for all x on 
a g x ^ 6. 

5. Let 2/1 (x), yz(x) be linearly independent solutions of 

y" + P(x)y - (5.49) 

for a ^ x ^ 6. Let y z (x) be any solution of (5.49). Show that 

-f 



(: Show that W(yi, y* y t ) = for a ^ x ^ 6. 
6. Let yi(x), 2/ 2 (x), . . . , y(x) be linearly independent solutions (on the range 
a g x ^ 6) of 

(5.50) 
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If y(x) is any solution of (5.50), show that 



CkVk(x) 

k^l 
for a ^ x b. 

6.7. Linear Differential Equations. The differential equation 



is called an nth-order linear differential equation. What can we say 
about a solution of (5.51) subject to the initial conditions i/(xo) = 2/0, 
tf'(zo) = 2/J, y"(zo) = 2/'o', . . . , 2/ (n - !) (zo) = 2/ ( n - 1) ? Assume y^x) a 
solution of (5.51). We note that (5.51) can be written as a system of n 
first-order equations by the simple device of introducing n 1 new 
variables. Define y z , y*, . . . , y n as follows: 



(5.52) 



dx 



The last equation of the system (5.52) is (5.51) in terms of 2/1, 2/2, . . . , 
y n . Conversely, if [i/i(x), y*(x), . . . , y n (x)] is a solution of (5.52), then 
y\(x) is easily seen to be a solution of (5.51). Now (5.52) is a special case 
of the very general system 1 

3J = P(x, y l , y*, > 2/ n ) i = 1, 2, . . . , n (5.53) 

As in Sec. 5.5 it can be shown that, if the/* satisfy the suitable Lipschitz 
conditions, there exists a unique solution y 1 (x) 1 y*(x), . . . , y*(x) of 
(5.53) satisfying the initial conditions y*(x ) = yl, i = 1, 2, . . . , w. 
The Lipschitz condition for the /* is 

n 

!/*(, 2/S 2/ 2 > > 2/ n ) "" /^(^, 2 1 , ^ 2 , . ^ . , z n )| < Af V |i/* 2*| (5.54) 

t-i 

for all x in a neighborhood of x = Xo, Af fixed. 
1 The exponents are superscripts, not powers. 
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If p(x), Pi(x), f = 1, 2, . . . , n of (5.52) are continuous in a neighbor- 
hood of x = XQ, then (5.54) is easily seen to hold. Hence (5.51) has a 
unique solution subject to the n initial conditions stated above. 

The reader can show easily that if 2/1 Or), yz(x), . . . , y n (x) are linearly 
independent solutions of the homogeneous equation 



and if y(x) is any particular solution of (5.51), then 

n 

y(x) = <W,(*) + (*) (5-56) 



is the most general solution of (5.51). Let the reader show that the 
d are uniquely determined from the initial conditions on y and its first 
n 1 derivatives at x XQ. 

In general it is very difficult to find the solution to (5.51), and it is 
necessary at times to use infinite series in the attempt. This method 
will be discussed in a later paragraph. There is one case, however, 
which is the simplest by far. Naturally, one expects the difficulties in 
solving (5.51) to be alleviated to some extent if the p t (x) of (5.51) are 
constants. We study now this case. The homogeneous equation 



p t = constant: i = 1, 2, . . . , n (5.57) 

can be solved as follows: Assume a solution of the form y = e mx . Sub- 
stituting into (5.57) yields 

e mx (m n + piin*- 1 + p 2 m n -* + - + p n ) = 
Hence if m is a root of the polynomial equation 

/(*) Z + Pl*"- 1 + P22"- 2 + + Pn = (5.58) 

then y = e mx is a solution of (5.57). Equation (5.58) is easily obtained 

d k v 
from (5.57). One replaces -p[ by z k , k = 0, 1, 2, . . . , n. If the n 

roots of (5.58) are distinct, call them mi, m 2 , . . . , m n , then the general 
solution of (5.57) is 



y T 



(5.59) 



The reader can show that if the w* are distinct, then 

^ 
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First let the reader show that 

1 1 1 

t\ ra 2 m n 



W = exp 



1 x } mk\ 
\ *i / 



ml 



(5.60) 



Next let the reader notice that 



F(u) - 



is a polynomial of degree n 1 in u. The polynomial equation F(u) = 
certainly vanishes for the n 1 distinct roots u = ra 2 , ra 3 , . . . , m n . 
If F(mi) were zero, then 



1 


1 


1 


... i 


u 


ra 2 


m 9 


' ' m n 


u 2 


ml 


ml 


- - - ml 


y n-l 


ml~ l 


msr 1 


' ' <-* 



1 


1 


1 


1 


ra 2 


ra 8 


ra 4 


ra n 


ml 


ml 


ml 


ml 


m?- 2 


raj" 2 


. 


w~ 2 



= 



Why? 



Continuing with this line of reasoning, the reader can show that as a 
consequence of assuming W = one finally obtains 







a contradiction, since ra n 5^ ra n _i. 

The case for which some of the roots of (5.58) are equal must be treated 
separately. Before attacking this problem we shall find it beneficial to 
introduce the operator 

d (5.61) 



j- 
dx 






The notation sf(x) means 3- f(x) = f'(x). Similarly, 

ax 

*/(*) - s[sf(x)} = [/'(*)] = f"(x) 

and s" s ~. We define (s + a)f(x) = sf(x) + af(x) = f'(x) + af(x) 
for any scalar a. Let the reader show that 



(5.62) 
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if a and b are constants. Equation (5.57) can be written 

(s* + pis"- 1 + p 2 s n ~ 2 + + Pn-is + p n )y = (5.63) 
or (s m-i)(s m 2 ) ( m n )y = (5.64) 

using the result of (5.62). If a is constant, we have 

s(e ax y) = e ax sy + ae ax y = e ax (s + a)y 
Let the reader show by mathematical induction that 

s n (e ax y) = e x ($ + a) n y (5.65) 

Now let us assume that (5.58) has the distinct roots mi, w 2 , m 3 , . . . , 
w* of multiplicity i, a 2 , . . . , a* so that (5.64) can be written 

/()y = (s - rai)* 1 ^ - m 2 )' (*- m k )**y = (5.66) 

We note that, if y(x) satisfies (s nik) ak y(x) = 0, then 2/(#) also satisfies 
(5.66). Now 

e -mk*( s - m k ) ak y(x) = s ak [e- mkx y(x)} 

from (5.65), so that any y(x) which satisfies (s m k ) ak y(x) = also 
satisfies s ait (e~- mkx y} = 0, and conversely. Hence 



so that y(x) = e m "*(d + C*x + + C^-iz"- 1 ) (5.67) 

It is easy to verify that (5.67) is a solution of (5.66). If a k = 1, then 
y(x) = Cie mkx . Equation (5.67) contains a k constants of integration. 
Applying the same reasoning to the remaining roots yields n constants of 
integration. We omit the proof that the solutions thus obtained, 
namely, 



are linearly independent. 

Example 5.8. The differential equation y" y = admits the solution y e m * 
if m 2 1 or m 1. The most general solution is 

y . Cie* -f C 2 e~* 

If Ci ^, C 2 = ~y, then t/ sinh x. If Ci C 2 ?, then j/ cosh x. It is easy 
to verify that sinh x and cosh x are linearly independent. The most general solution 
can also be written in the form 

y Ai sinh x f 4 2 cosh a: 

Example 5.9. We wish to solve y" -f n 8 i/ subject to the initial condition 
0, 2/'(0) - n, n real. c w * is a solution of y" + n*y - if m* -f n* - or 
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m in. Hence the solution is 

y Ae ni * + Be-*'* 

From (x = 0, y 0) we have A + B, and from (x * 0, ?/' n) we have 

n = (An Bn)i 

or A B * Hence yl ~ , = 5", and j/(x) ^ (*>"* e~ nt *) sin nx. 

Let the reader show that y ** A sin nx + B cos nx is also a general solution of 
?/" + n*y - 0. 

Example 5.10. Consider 

Jg+J*_ 6 S + 4*-0 (5.68) 

rfr 6 dx 4 <&r 8 dj* 

In operational form (5.68) becomes 

(s 6 + * 4 - 6* + 4* 2 )s/ - 

or ( - l) 2 (s* -f 2 + 4)y - 0. The zeros of /(m) - m*(ra - l) 2 (w 2 + 2m + 4) 
are m 0, 0, 1, 1, 1 \/3 * Hence the general solution of (5.68) is 

y . e (A -f Bs) + e-(C -f Dx) + E exp [(-1 + \/Zi)x] 

+ Fexp[(~l - -v/3^>] 
or y - A + Ex + (C + />*)* + c-(G cos V3 x + F sin V3 a 1 ) 

It has been explained before that the most general solution of an 
inhomogeneous equation can be written provided : 

(i) A particular solution is known. 

(ii) The general solution of the homogeneous equation is known. 

The second part has just been considered. It remains to study meth- 
ods by which a particular solution can be found. Here any guess can 
be made, and, if successful, no further justification is needed. Quite 
often a particular solution can be determined by inspection. The work 
involves finding certain undetermined coefficients. For example: 

(a) If p(x) of (5.57) is a polynomial, try a polynomial of the same 
degree. 

(6) If p(x) = ae rx , try Ae rx . 

(c) If p(x) = a sin coz + & cos (*>#, try A sin <ax + B cos <#x. 
A few examples should make clear what we mean. 

(a) y" + y "" 2x 2 . We assume a particular solution of the form y a -f- bx + ex*. 
Then y' 6 + 2cz, y" - 2c so that of necessity 2c -f a -f bx -f ex* == 2z* and 
c 2, 2c -4- a 0, 60. Hence t/ A sin x 4- 5 cos x -f 2x* 4 is the general 
solution of y" + y ** 2x*. 

(b) y" y ** x 2e**. Assume a particular solution of the form 

y a -h &z 4- cc s * 

Then j/' 6 -f- 2r^ >r , t/" - 4ce lz , and of necessity 4ee la a bx cc 1 * B x 2e f *. 
This yields 3c 2, a 0, 6 1. The general solution is 
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(c) y" + 4j/' 2y 2 sin x 3e* 4- 1 2x. We try a particular solution in the 
form y = a sin x + 6 cos x + ce* + d + /- Of necessity 

a sin x 6 cos x -f ce* -f 4a cos x 46 sin x + 4ce* + 4/ 2a sin x 

- 26 cos x - 2ce* - 2d - 2fx = 2 sin x - 3c* + 1 - 2x 

Equating coefficients of sin x, cos x, e*, etc., yields 3a 46 2, -36 -f 4a 0, 
3c - -3, 4f - 2d - 1, -2/ - -2, so that a - -^, 6 - -^, c - -1, d - |, 
/ 1. The reader can verify that y ^ sin x ^ cos x "~ e x + ? + x ls A 
particular solution of (c). 

Although in many cases a particular solution is easily obtained by 
the method of undetermined coefficients, it is important to have a formula 
valid in all cases. The problem is to find a particular solution of the 
equation 

g + * 3 + + "-> g + -* - PW (5-69) 

Let g(x) be a solution of the homogeneous equation 

+..*+ +-2 + -0 (5.70) 

which satisfies the initial conditions 

0(0) = 0'(0) = g"(0) = = f<-(0) = 0<-(0) - 1 
We prove that 

y(x) = fc g(x - t)p(t) dt (5.71) 

is a particular solution of (5.69). Differentiating (5.71) twice yields 
y'(x) = g(x - x)p(x) + j* * g( * ~ 



(x-t) 
since g'(0) = 0. Further differentiation yields 



/; 



dx 



i} P (t) dt 
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since (n ~ 1) (0) = 1- If these values are substituted into (5.69), one 
obtains 

yW(x) + aiy^ l) (x) + + a n y(x) 

= p(x} + *vo + ai ^po + . . . 



The expression 

(* - 



vanishes since g(x) satisfies (5.70). The derivatives of g(x) in (5.72) are 
evaluated at x t, but since g(x) satisfies (5.70) for all x we certainly 
know that g(x) satisfies (5.70) at x t. We have shown that (5.71) is a 
particular solution of (5.69); the a t , i = 1, 2, . . . , n, of (5.69) are 
constants. 

Example 5.11. Consider y" y sin x. We first solve y" y 0, which 
yields the solution g(x) = Ae x + Be~*. We now impose the initial conditions 
gr(0) 0, gr'(O) = 1. This yields A + B = 0, A - B = 1, so that A ^, B = -^. 
Thus gr(a:) ^-(e* e"*) = sinh x. A particular solution of y" y = sin x is 



\(^ x ^ - e-t*-) sin tdt 

P X r x p~ x f 

~ \ e~ l sm t dt - ~- / 
2 Jo 2 Jo 



sin 
o 



The complete solution is 

y = Ae* -f Be~* - ^ sin x -f \e x - \e~* 
= Aie 1 -f Bif" 7 - | sin x 

Example 5.12. it/'" 3j/" + ty' 2y = e x sec x. The solution of 

y'" - 3j/" + 4y' - 2y - 

is 

2/ =x ^(x) Ae x -f e*(B sin x + C cos x) 

If gr(0) 0, 0r'(0) - 0, ^"(0) - 1, we must have 

A -f C * 

A 4- C + B o 

A -f 2B - 1 

which yields B - 0, A - 1, C - -1. Thus 



** j [e*-* - e*-* cos (x - t)]<* sec t dt 

- e* I sec tdt - e* I* (cos x -f sin a; tan t) dt 

e* In (sec x + tan x) xc* cos x + c* sin x In cos a; 
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The general solution is 

y = e*[A + B sin x + C cos z -f In (sec x + tan z) x cos + sin x In oos #1 

Example 5.13. Method of Variation of Parameters. Equation (5.71) is a valid 
particular solution of (5.69) for constant coefficients. We look for a particular solu- 
tion of 



+ pi(x} + pt(x} + + p " (x)y = p(x) 

To simplify things, we shall discuss the general third-order linear differential equation 

S + Pl(x) S + P2(X) Tx + pi(x}y = P(X) (5 ' 73) 

Let 3/i(.c), 2/2(2*), 2/a(a;) be linearly independent solutions of the homogeneous equation 
derived from (5.73). 

3 

We know that y ^ A t y l is the most general solution of the homogeneous equa- 

i = i 

tion. The method of variation of parameters consists in attempting to find a par- 
ticular solution of the inhomogeneous equation by varying the A % , i = 1, 2, 3, that is, 
by assuming that the A,, i 1, 2, 3, are not constant. Assume 

y uiyi + uzyz + u z yz (5.74) 

We shall impose three conditions on -5-^ i = 1, 2, 3. Two of these conditions will be 



Differentiating (5.74) and making use of (5.75) yields 

3 

u t % (5.76) 



Differentiating again and making use of (5.75) yields 

3 
-7 : = > Ui -j^ 

dx z Lf dx* 

Finally we have 

3 

-~ = Y u l -T-* -f Y -j 1 -{ (5.78) 

We multiply (5.77) by pi(x\ (5.76) by p z (x), (5.74) by p*(x) and add these results to 
(5.78). If y(x) is to be a solution of (5.73), of necessity 
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Equations (5.75) and (5.79) are three linear equations in the unknowns ~i i = 1, 2, 3. 



Solving for -~ yields 



du\ 
~dx 



2/2 2/ 

1/2 2/3 

/ \ " " 

p(x) 2/2 y* 



2/i 2/2 2/3 

y( 2/2 2/3 

// n n 

vi 2/2 2/3 



, 2/2, 



(580) 



The Wronskian W(y\^ yz, #3) does not vanish since y\, 2/2, 2/a are linearly independent. 
Hence 



.() 



i, 2/2, 



and in general 



r^ - 

Ut(X) " Ja 



^dx 



,, 2/2, 1/3) 
where 2/4(2;) ?/iCr), ?/ B (^) = 2/2(x). The reader can verify that 



(5.81) 



i: 



), y.+2) 



(5.82) 



is a particular solution of (5.73). 

Example 5.14. We consider y" -f (1/J")?/' (l/x 2 }y x. It is easy to verify that 
2/i(x) x, 2/2(0;) 1 /x are linearly independent solutions of #" -f (l/x)y' (1/x 2 )?/ 0. 
We have 



v Jf 2 V / 

p(x) y' 2 (x) 


/ -x(l/'x)dx a- 2 


2/1(3) 


2/2(3) 


^ / 




1 


4 


2/i (3) 


2/i(3) 


7o 




X 








1 


__ J_ 











X* 




2/1(3) 









2/i (3) 


P(3) 


3 4 





-2/a 



A particular solution is y (x 2 /4)(x) + ( x 4 /8)(l/a:) x 8 /8. The complete solu- 



tion is y * Ax H h is- 

X O 



Problems 



1. Ly" -f ^2/' -f (I/O?/ - 0, L, 1?, C constants. Discuss the cases R* - ~ | 0. 

2. y" - y * x 

3. y"" - 22/" + y - 5 sin 2x 

4. 2/'" - y' - * + 1 
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5. y"' + y r - 2 sin x 

6. y" - 4j/' + 5j/ - e* + 4 sin x 

7. If t/i is a particular solution of y" -f p(x)y' -f ?(aOy ** nOc) and 1/2 is a particular 
solution of y" -f p(x)y r -\- q(x)y r 2 (x), show that 1/1 + ?/ 2 is a particular solution of 
y" + p(a;)y' + (*)y - n() + r*(x). 

8. y" +y ** tan a: 

9. ]/" 2#' -1- i/ cos 2 x 

10. y" - 4y' + 4y = 2xe* 

11. Solve y'" y" = 1 + x -f sin z by the variation-of -parameter method. 

6.8. Properties of Second -order Linear Differential Equations. We 
consider 



(a) If p(x) and g(x) are continuous on a ^ x ^ 6, there exists a unique 
solution y(x) of (5.83) subject to the initial conditions y(xoV 2/o, 



cit/ "ii 

y'(xo) = y' Q , a g x ^ 6. Moreover -^ is continuous since ^-~ exists. 

ctx QfX 

We now show that if y(x) is a solution of (5.83) then y(x) cannot have an 
infinite number of zeros on the interval a ^ x ^ 6 unless y(x) = 0. The 
proof is as follows: Assume y(x) vanishes infinitely often on the interval 
a ^ x ^ 6. From the Weierstrass-Bolzano theorem there exists a limit 
point c, a ^ c ^ 6. We can pick out a subsequence xi, x$, . . . , x n , 
. . . which converges to c such that y(x n ) = 0, n = 1, 2, . . . . From 
the theorem of the mean y(x n ) y(x n -i) = (x n a?n-i)y'(fm) so that 
y'(tn) = 0, Xn-i ^ {n ^ n, n = 1, 2, . . . . Hence y'(c) = lim y'({ B ) = 

n * 

since the n also approach a: = c as a limit and since y'(x) is continuous 
at x c. Moreover y(c) = lim f(x n ) = 0. But y(x) =0 satisfies 

n *w 

(5.83), and y(c) = 0, j/'(c) = 0. Since y(x) is unique, we must have 
y(x) s 0. Q.E.D. 

(6) Let 2/1 (x) and y*(x) be linearly independent solutions of (5.83). 
Let t/i(ci) = 2/1(^2) = and 2/1(3) 5^ for a ^ ci < x < c 2 ^ fe. We 
show that 1/2(0) = 0, Ci < c < c 2 , that is, the zeros of ^i(x) and y*(x) 
separate each other. Assume y<t(x) j* for c\ < x < c 2 . Then 





has no singularities for c\ ^ x ^ c 2 . Why is it true that 2/2(^1) ^ 0, 
!/2(c) 5^ 0? Moreover <p(d) = ^>(c 2 ) = 0. From the theorem of the 
mean ^>'() = for Ci ^ { g c 2 . But 
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Since yi(x) and y*(x) are linearly independent, we know that W(yi, t/ 2 ) ^ 
for a ^ x ^ 6. Hence ^'(f) cannot be zero, a contradiction. y*(x) must 
be zero for some x, Ci < x < c 2 . Q.E.D. 
(c) We can write (5.83) in the form 



<K*)V-0 (5.84) 

We multiply (5.83) by exp ^* p(t) dt} and write 

& [exp (/; p(t) dt) |] + q(x) exp (// p(0 eft) y - 



so that K(x) = exp f * p(t) dt, Q(x) = q(x) exp /"* p(t) dt. 

^ 
An important differential equation is the Sturm-Liouville equation 

- K(x) ^ + Xg(x)i/ = (5.85) 

Qjjj |_ CLX J 

where X is a parameter. Assume K(a) = K(6) = 0, arid let yi(x, XO be a 
solution of (5.85) for X = Xi, y 2 (x, X 2 ) a solution of (5.85) for X = X 2 . 
We show that 




q(x)yi(x, Xi)y 2 (x, X 2 ) dx = Xi ^ X 2 (5.86) 

We have 



(5.87) 



Multiply the first equation .of (5.87) by y 2 and the second by 2/1, and 
subtract. This yields 

dx [_ \ dx CLX I \ 

Equation (5.86) follows if we integrate (5.88) over the range a g x ^ 6. 
The orthogonality property (5.86) will be discussed in greater detail in 
Chap. 6 dealing with orthogonal polynomials. 

(d) Let yi(x) be a solution of y" + Gi(x)y = 0, and let y*(x) be a solu- 
tion of y" + G*(x)y = 0. Assume further that y^a) = ?/ 2 (a) = a > 0, 
2/i(a) = 2/ 2 (a) = ft GI < G 2 . We show that y*(x) vanishes before yi(x) 
vanishes, x > a. The proof is as follows: We have y{' + Gi(x)yi = 0, 
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2/2' + G 2 (x)2/2 = so that 

2/22/1' ~ 2/12/2' + (G\ - (72)2/12/2 = 

jz (y*y( " y&*) + ( Gi ~~ #2)^12/2 = o 

- Gi)yi(x)y t (x) dx 

- #1)2/12/2 dx 



f c d f c 

I T~ (2/22/1 2/12/D dx = I 
Ja X J a 

c ^ re 

a J a 



2/22/1 ~ 



) - a/3 + a/3 = / (G, - GJy^dx (5.89) 

Jo 

Let c be the first zero of y\(x), c > a, and assume 2/2(2;) ?^ for a g x ^ c. 
From (5.89) and 2/i(c) = we have 

2/ 2 (c)2/((c) = I C (02 - G02/12/2 dx (5.90) 



Now(? 2 (x) ~ (?i(x) > for a g x ^ c, ?/i(x) > for a ^ x < c, y 2 (x).> 
for a g x g c. Hence y<i(c)y((c) > from (5.90), so that y[(c) > 0. 
But 



^ 

a; c *^ ^ 
ar<c x<c 

since 2/1(2;) > and x < c. This is a contradiction. Q.E.D. The same 
result would have been obtained if 2/1(0) = 2/2(0) = a < 0. 

Let the reader show that, if y\(c\) = 0, y\(ci) = 0, y\(x) T for 
Ci < x < c 2 , then y^(x) = for some x on Ci g x ^ c 2 . 

Problems 

1. Consider y" + p(x)2/' + g(x)2/ = 0. Let ?/ = uv, and determine w(x) so that the 
resulting second-order differential equation in v(x) does not contain T- 

2. Consider Legendre's equation 

(1 ~ x2) S^ 2x fx + n(n + l)y = (5 * 91) 

Let yi(x, a), 2/ 2 (x, /3) be solutions of (5.91) for n =* a and n /3, a 7^ /3. Show that 

/ i t/i(a?, 0)2/2(0;, |3) dx 

3. Prove the second statement of (d). 

4. Give an example of (6). 

5.9. Differential Equations in the Complex Domain. Up to the present 
we have discussed differential equations from a real-variable view. 
Greater insight into the solutions of differential equations can be obtained 
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if we study the differential equations from a complex-variable point of 
view. The reader may recall that the Taylor-series expansion of 



/(*) = l 



1 + X 2 

about x = converges only for \x\ < 1. In a way this is puzzling since 
1/(1 + x z ) has no singularities on the real axis. However, the analytic 
continuation of /(x), namely, /(z) = 1/(1 + z 2 ), is known to have poles 
at z = i. The distance of z = i and z = i from z = is 1, so that 
the Taylor-series expansion of 1/(1 + z 2 ) about z = has a radius of 
convergence equal to unity. 

The simplest first-order linear differential equation may be considered 
without difficulty. If p(z) is analytic at z = zo, the solution of 

^ + p(z)w = (5.92) 

is w(z) = WQ exp I* p(t) dt and w(z) is analytic at z = zo. 

The first non trivial case is the linear-second order differential equation 

ft 111 fiitj 

~ + P(z} ^ + *W W = ( 5 - 93 > 

This equation is most important to the physicist and engineer. More- 
over, the methods used for solving it apply equally well to higher-order 
equations. In attempting to solve (5.93) we must obviously consider 
the coefficients p(z) and g(z). Let z = zo be a point such that p(z) and 
q(z) are analytic at z = ZQ. Remember this means that p(z) and q(z) 
are differentiate in some neighborhood of z = ZQ. A point z of this 
type is said to be an ordinary point of (5.93). Let R be the smaller of the 
two radii of convergence of the series expansions of p(z) and q(z) about 
z = z . We shall now prove Theorem 5.2. 

THEOREM 5.2. For any two complex numbers w , wj there exists a 
unique function w(z) satisfying (5.93) such that w(zo) = WQ, W'(ZQ) = w' . 
Moreover w(z) is analytic for |z z | < R. 

We begin the proof by removing the first-order derivative in (5.93). 

Let w = uv, u and v undefined as yet. Then -r- = u -y- + v -T-> 

dz dz dz 

, du dv , d 2 u 



Substituting into (5.93) yields 

. du\ dv , (d*u . du 
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If we set pu + 2 -r = 0, we see that 



u(z) 
Thus the substitution w = r exp \ -% f* p(t) dt\ reduces (5.93) to 

^ + ,/(*),' = (5.94) 

Let the reader show that J(z) is analytic for \z - z \ < It. Moreover 
the reader can also show that if v(z) is a solution of (5.94) then 

w = v exp I" -^ I* p(t) dt\ 

is a solution of (5.93). Next we attempt to reduce (5.94) to an integral 
equation. Assume v(z) satisfies (5.94) such that V(ZQ) = VQ, V'(ZQ) = v' Q . 
Then 



or -^ - V'Q = - / J(r)v(r) dr 

az J 2o 

Integrating again yields 

- vo - v' 9 (z - zo) = - I * F f J(r)v(r) dr d{ 

J 20 yo 



where ^>(f) = / JMvM dr. Note that <p(2 ) = 0. We now integrate 

Jzo 

by parts and obtain 



and () = 1*0 + P,( - 20) + f (r - 2)-/(f)''(D df (5.95) 

Jzo 

The reader can show that if v(z) satisfies (5.95) then v(z) also satisfies 
(5.94). Equation (5.95) is a Volterra integral equation of the first kind. 
It is a special case of the integral equation 

v(z) = A (z) + k(z, f)v(f) df (5.96) 



To find a solution of (5.95), we apply the method of successive approx- 
imations due to Picard (sfce Sec. 5.5). We define the sequence VQ(Z), 
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1/1(2), 
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. . . , v n (z), ... as follows: 







(r - 



(5.97) 



S( - 20) 



(f - 



- v k (z) = f z (f - 

J Zo 



Now 



so that the convergence of the sequence \v n (z) } hinges on the convergence 

00 

of the series ^0(2) + / \Vk+i(z) v k (z)]. We can write 

- w*_i(r)] df (5.98) 

We choose as our path of integration 
the straight line joining z o to z so 
that f = ZQ + t(z - 20), ^ t ^ 1, 
with f ZQ = t(z ZQ) and 

df (z ZQ) dt 

Let C be the circle of analyticity 
of J(z) with center at ZQ, and let z 
be an interior point. We construct 
a circle S with center at ZQ interior 
to C and containing the given 
point z in its interior (see Fig. 5.1). 
Since J(f) and f z are analytic 
inside and on /S, /(f)(f z) is 
bounded by some constant M , that 
is, \J(f)(f - *)\ < M for all z, f in and on S. Hence 

'(r- 




FIG. 5 1 



where M is any upper bound of v$(z) in and on *S. Similarly 
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(' ir - 2.1 i 

yo 

i* - ^oi 2 r , 

Jo ** 

By mathematical induction the reader can show that 

- v k (z)\ ~ 



Thus the series representing v n+ i(z) is bounded by the series 

n + l 

y M k \z - z,\ k 

fc-O 

which in turn is bounded by the series of constant terms /z } ^ 7 ^ since 

m*J K I 

*-0 

R > \z zo|. We know that the latter series converges to e MR so 
that from the Weierstrass M test the series representing v n +i(z) converges 
uniformly. Since each term of the series representing v n +i(z) is analytic, 
v n +i(z) converges to an analytic function t>(z) (see Prob. 7, Sec. 4.9). We 
now show that the limiting function, v(z\ satisfies (5.95). From (5.97) 

lim v n +i(z) = v Q + v' Q (z ZQ) + lim / (f 

n * n * J *o 

v(z) = ^o + ^o(z ZQ) + / lim (f 



It is possible to take the limit process inside the integral because of the 
uniform convergence of v n (z) to v(z). 

Next we prove that v(z) is unique. Let u(z) also satisfy (5.95). It is 
easy to verify that r(z) = v(z) u(z) satisfies 

r(z) = I Z (r ~ *V(f)r(r) * (5.100) 

yo 

Since w(s) and v(z) are analytic in and on S, the function r(z) is bounded 
by some constant K, |r(f)| < K. From (5.100) we have 

|r(*)| < MK ^ \df\ = MK\z - | 



206 ELEMENTS OF PUBE AND APPLIED MATHEMATICS 

Applying this inequality to (5.100) yields 



M |r(f)| |df| < M*K |f - ,| \df\ 



2! 

Continuing this process yields 



\r(z)\ < KM* 1 * 



n\ 
for all integers n. Since lim ' *' g "" <go ' = 0, we can make \r(z)\ 

n * * H ! 

as small as we please. This is possible only if r(z) s= so that u(z) s v(z). 
Q.E.D. 

Example 5.15. We consider w" zw f w =* 0. Certainly 2 is an ordinary 
point. An analytic solution is known to exist. The easiest way to find this solution 

00 

is to let w } c n z n . The c n are to be determined by the condition that w satisfy 

n-O 

w" zw' w ** 0. We have 



00 M 

W ' " A nCn2!n ~ 1 W " "" A n ^ n " 



n-l n-2 

* 00 00 

V* V* V* 

so that / n(n l)c n z n ~~ 2 / nCnZ* > c n z n 

n^2 n-l -0 

A power series in z can be identically zero only if the coefficients of z n , n - 0, 1, 2, 
. . , vanish. Hence 

(n + 2)(n + l)c n +2 - (n + l)c n =0 n - 0, 1, 2, ... 
We obtain the recursion formula 

cn+t =^2 n =0,1,2, ... 
We note that c 2 - c /2, c 4 - c 2 /4 c /(2 - 4) - c /2 2 2!, 

4 Co __ Co 

C6 * 6 "" (2-4-6) ~ 23! 
. . . , cj n c /2 n n!. Also ci Ci/3, d - c 8 /5 - d/(l -3-5), 

C( C 

C7 * "7 " (1 3 5 7) 
- ci/(2n + 1)!!, where 

(2n + 1)!! - (2n + l)(2n - l)(2n - 3) 5 - 3 1 
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The constants CQ and c\ are arbitrary constants of integration. 

of w" zw' w = is 
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The general solution 



n0 



(2n + 1)!! 



Problems 



1. Show that, if v(z) satisfies (5.95), then t>(z ) = V Q , V'(ZQ) tv Also show that 
v(z) satisfies (5.94) if v(z) satisfies (5.95). 

2. Prove (5.99) by mathematical induction. 

3. Solve w" z*w f + zw =* for w(z) in a Taylor series about 2=0. 

4. Solve w" -h 1/(1 z)w' -h IP = in a Taylor series about 2=0. 

6.10. Singular Points. Let z = a be an isolated singularity of either 
p(z) or q(z) in (5.93), and surround z =* a by a circle, C, of radius p, 
such that p(z) and g(z) are analytic for < \z a| < p. Now let zo be 
an ordinary point of (5.93), < \ZQ a| < p (see Fig. 5.2). From 
Sec. 5.9 there exists a solution of (5.93) which is analytic in a neighbor- 
hood of z . Call the solution Wi(z, ZQ). The Taylor-series expansion of 
Wi(z) = Wi(z, ZQ) about z = zo converges up to the nearest singularity of 
p(z) and q(z) 9 which may be the point z = a or a point on the circle C. 
In Fig. 5.2 C' is the circle of convergence. Now let F be any simple 
closed curve through ZQ lying inside C and surrounding z = a (see Fig. 5.2). 
We choose a point z\ on F, z\ inside C". Since w>i(z, ZQ) is defined at zi, 
we can compute Wi(z\, zo), w((zi, ZQ). Since z\ is an ordinary point of 
(5.93), there exists a unique function Wu(z) such that Wu(z) satisfies 
(5.93) and Wn(zi) = M>I(ZI, ZQ), w(i(zi) = M((ZI, z ). w u (z) is an analytic 
continuation of Wi(z 9 ZQ). Its domain 
of definition is the interior of the circle 
C" (see Fig. 5.2). This process of 
analytic continuation can be contin- 
ued, and the reader can show that 
in a finite number of such continua- 
tions we can reach z . Let w*(z) be 
the analytic function which is the ana- 
lytic continuation of Wi(z), both Wi(z) 
and w*(z) analytic in a neighborhood 
of z ZQ. We write w*(z) == Awi(z) 
and look upon A as an operator associ- 
ated with analytic continuation. We 
leave it as an exercise for the reader to 
show that, if Wi(z) and w z (z) are lin- 
early independent solutions of (5.93) for some neighborhood of z = o, 
then Ate?i(z) and Aw 2 (z) are also linearly independent. 

Since z = a is a singular point, we cannot expect necessarily that 




FIG. 5.2 
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hw\(z) SB Wi(z). Let us now attempt to find a solution w(z) of (5.93) 
such that 

Aw(z) = \w(z) (5.101) 

If Wi(z) and w^(z) are linearly independent solutions of (5.93) for a 
neighborhood of z = 20, we have 

w(z) = 



The last equation of (5.102) implies that AciWi = c\Kwi and that 



In other words, A is a linear operator. Let the reader deduce this fact. 
From (5.101) and (5.102) 



which in turn implies 

X) + c 2 a 2 i =0 



.- 
Cia 12 + c 2 (a 22 - X) = 



A nontrivial solution in ci, c 2 exists if and only if 

n - X o 

i2 a 22 X 



(g 



Equation (5.104) is a quadratic equation in X, so possesses two distinct 
roots or two equal roots. 

Case 1. Let <f>i(z), ^> 2 (z) be the functions of (5.101) corresponding to 
the distinct roots Xi, X 2 , that is, 

A^r= X,v?, i = 1, 2 (5.105) 

It is an easy task to show that <f>\ and <p z are linearly independent. 
Let us note the following: Let z a = \z a\e ie and 

(2 - a ) a = |z - alV*' 
The analytic continuation of (z a) a around F increases 6 by 2ir so that 



We choose ay, j = 1, 2, so that e** = X,, j = 1, 2. Hence 

A(* - a)i = Xy( - a)* j - 1, 2 (5.106) 
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Combining (5.105) and (5.106) yields 

A[(z - a)-^(z)] = A(z - a)-<A^(*) Why? 
= ^ (z - d)^\ j9j (z) 
= (z- a)-><pj(z) j = 1, 2 (5.107) 

Thus the functions Fj(z) = (z a)~ a ><ft(z), j = 1, 2, are single-valued 
inside C with a possible singularity at z = a. From Laurent's expan- 
sion theorem we have 



(z - a)- a ^(z) = b n (z - a)" 

n= ao 

00 

(z - d)- a *<p<L(z) = c n (z - a) n 



or 



= (2 a) a " Y c n (2 - a) n 

n= oo 

Thus both (f>\(z) and ^2(2) have, in general, a branch point and an essential 
singularity at z = a. 

Case 2. The reader is referred to MacRobert, "Functions of a Com- 
plex Variable," if \i = X 2 . In this case 

<p(z) - (z - a)" [t0,(2) + ^ In (2 - a)] (5.109) 

where Wi(z) and w^(z) are analytic at z = a. 

An important case occurs when the functions <0i(z), <p*(z) of (5.108) 
have no essential singularities. In this case we can write 



*,(*) = (z - a)"fc(*) j = 1, 2 (5.110) 

where $j(z) is analytic at z = a; #/(a) 5^ 0. Now 

= 



so that <f> 2 <f>{' - ^1^2' + p(^)(^2^{ ~ <Pi^ 2 ) = 



Thus -T- (^2^1 ^1^2) = p(z)(<f>*<P\ 

i ^w/ N (5.H2) 

j / x 1 dW(<pi, <pz) ^ ' 

and p(z) = - ^7 - r - j ' 

2 dz 
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Making use of (5.110) to (5.112), let the reader show that 

P = r + P*M 

(5.113) 






with p*(z), g*(z) analytic at z = a. The same result occurs for <p(z) of 
(5.109). We notice that p(z) of (5.113) has at most a simple pole at 
z = a and q(z) has at most a double pole at z = a. This suggests the 
following definition: 

DEFINITION 5.1. 2 = z is said to be a regular singular point of 

w" + p(z)w f + q(z)w = (5.114) 

if: 

(i) ZQ is not an ordinary point. 

(ii) (z zo)p(z) is analytic at z = 2 . 

(iii) (2 zo)*q(z) is analytic at 2 = 20- 

We now attempt to find a solution of (5.114) in the neighborhood of a 
regular singular point. To simplify matters, we assume that z = 0. 
The transformation z' = z ZQ transfers the singularity to the origin. 
Let w(z) = z a u(z), a an unknown constant, u(z) undefined as yet. For 
w = z*u we have w' = z a u'(z) + az a ~ l u, 



w " = z u" + 2az- 1 u' + a(a - 
If w(z) satisfies (5.114), then u(z) must satisfy 
z*u" + [2az + p(z)z*]u' + [a(a - 1) + azp(z) + z*q(z)]u = (5.115) 

NOW Zp(z) = PQ + piZ + P2Z 2 + ' ' ' 

z 2 q(z) = qo + q\z + q*z 2 + 
Equation (5.115) becomes ^ 

zu"(z) + P(z)u'(z) + Q(z)u(z) = (5.116) 

provided a is chosen as a root of the indicial equation 



a(a - 1) + ap Q + q - (5.117) 

with P(z) = 2a + zp(z) = 2a + p + p& + p& 2 + ' ' ' 

Q(z) = a(pi + P& + ) + 0! + q& + 

Conversely, if u(z) satisfies (5.116), then w(z) = zu(z) will satisfy (5.114) 
provided a satisfies (5.117). The existence of a solution of (5.114) hinges 
on the existence of a solution of (5.116). If u(z) satisfies (5.116), 

u(0) = w 
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then a double integration of (5.116) yields 

zu(z) = [P(0) - l]u* + [* [2 - P(r) + (T - z)(Q(r) - P'(T))]I*(T) dr 

Jo 

and u(z) = [P(0) - I]u + \ j B ( z > r MO dr * * ( 
With B(Z, r) = 2 - P(r) +(r - *)[$(T) - P'(r)] 



We leave it to the reader to show that, if u(z) satisfies (5.118), then u(z) 
satisfies (5.116). The reader can also show that if we define u(Q) = UQ 
then u(z) of (5.118) is analytic at z = so that (5.118) holds for z = 0. 
To do this, the reader must show that lim u(z) = w . 

z->0 

Now P(0) = 2a + po and a\ + a 2 = 1 po if <*\ and a 2 are the roots 
of (5.117). If i = 2 , then P(0) s 1 so that (5.118) becomes 



i / 

^ Jo 



1*00 = - B(z, r)u(r) dr (5.119) 

z Jo 

If we attempt to prove the existence of a solution of (5.119) by Picard's 
method of successive approximations, we might expect trouble at z = 0. 
Let our first approximation be UQ(Z) = uo, and define 

1 



Let the reader show that lim Ui(z) = i/ , so that, if we define 

Ui(z) becomes continuous and, indeed, analytic at z = 0. Continuing 

this process yields 



u n (z) = - \ B(z, r)Wn-i(r) dr (5.120) 

with Mn(O) = w , n = 0, 1, 2, . . . . We define u n (0) = w , and the 
reader can show that u n (z) is analytic at z = 0. Since P(0) = 1, we can 
write P(r) = 1 + rf(r) and B(z, r) = 1 - r/(r) + (r - z)[Q(r) - P'(r)]. 
We can certainly pick a small circle, C, with center at z = such that 
" \B(z, T)| < R < 2 for all 2, r inside and on the circle. Moreover 

\Ul(z) UQ(Z)\ = 



I - P(r) + (r - z)[Q(r) - P'(r)]\ dr 
-rf(r) + (r - z)[Q(r) - P'(r)]) dr 



If we let r = te, ^ < g 1, / the variable of integration, it is easy to 
see that 

\Ui(z) UQ(Z)\ < A\z\ A = constant 

for all z inside C. 
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Since \u?,(z) Ui(z)\ 



\ f 

z Jo 



T)[UI(T) - UO(T)] dr 



we have 



- ui(z)\ 



Continuing, we obtain 



AR / |r| \dr 



\u n (z) - u n -\(z)\ < A 



(ir 



Since the series A\z\ Y (R/2) n ~ 1 converges uniformly inside and on C, 

n=l 

remember R < 2, the sequence {w n (z) ) converges uniformly to an analytic 
function u(z). As was done in Sec. 5.9 for z = o an ordinary point, it 
can be shown that u(z) satisfies (5.116) and (5.115). It is then trivial 
to show that w(z) = z*u(z) satisfies (5.114). 

Example 5.16. zw" + w' w = or w" + (l/z)w f (1/2)10 = 0. In this case 
p(z) 1/z, q(z) l/z and zp(z) = 1, z 2 q(z) z so that z = is a regular 
singular point. We know that a solution exists in the form w z a u(z), where a 
satisfies a quadratic equation and u(z) is analytic at z =0. One of the simplest ways 

00 00 

of determining u(z) is to assume u(z) } c r ,z n , w(z) } c n z n+a . The series for 

n=0 n=0 

w(z] is substituted into zw" -j- w' w = 0, and the coefficient of each power of z is 
equated to zero More specifically, we have 

no oo 

w'(z) - Y c n (n + a)z+ a - 1 w"(z] Y c n (n + a)(n -fa- l)z n + a -* 
n=0 n=0 

00 ^ 

and hence w(z) y c n z n+a satisfies zw" -f w' 10 = if and only if 
n = 

00 00 00 

Y c n (n + )(n + - D^^- 1 + Y c n (n + a)z n+a ~ l - Y c n z n+a (5.121) 

n=0 n=0 n=0 

The lowest power of z which occurs in (5.121) is z a ~ l . The coefficient of z a ~ l is 
coot(a 1) -1- c a s c [a(a 1) + ]. If (5.121) is to be satisfied, we must have 

C [a(a - 1) + ] - 

Co is to be arbitrary so that a must satisfy the quadratic (indicial) equation 

a(a - 1) -f - 
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or a 2 = 0. The roots of the indicial equation are a = 0, 0. The coefficient of z n+a ~ l 
m (5.121) is 

c n (n + a)(n +-!)+ Cn (n + a) - C H -I n - 1, 2, 3, . . . 
If (5.121) is to be satisfied, we must have 

c n (n + a)(n +-!)+ c n (n + a) - c n -i - 
or c n - (^qf^yi n = 1, 2, 3, ... (5.122) 

Equation (5.122) is the recursion formula for the c n , n 1, 2, . . . . Since a 0, we 
have 

c n = ^ n - 1, 2, . . . 

., , _ Co _ Ci _ Co _ C 2 _ Co _ Co 

so that Ci - C 2 - - - c 3 - 7 



(37)2 
and by induction c n = c /(n!) 2 . Hence a solution of zw" + 10' 10 = is 



w(z) = A ( Co = A 

n = 

We postpone the discussion for finding a second independent solution of 

zw" -\- w' w 

00 

Zz n 
, , NO converges for ^ |z| < . This is exactly the 
(n!) z 
n = 

region of analyticity of zp(z) 1, 2 2 g(2;) = z. 

The reader can find a proof in Copson, " Theory of Functions of a 
Complex Variable," that, if zp(z) and z 2 q(z) are analytic for \z\ < R, 

00 

then w" + p(z)w' + q(z)w = has a solution w(z) = ^ c n z n+a and the 

n = 

00 

series N c n 2 w converges at least for |z| < /J. The roots of the indicial 

n = 

eguation need not be equal. 

Example 5.17. zw" + (z l)w' + w = 0. The reader can easily note that z 

00 

is a regular singular point. Let w(z) / Cn^* 1 " 1 "", so that 



Cn (n -h a)^"^- 1 

n-0 

00 

w"(z) V c n (n -f a)(n + a - l)z w + a ~ 2 . Substituting into 



n-0 
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zw" + (z - l)w' + w - 
yields 

oo 

c n (n + )( n + - 2)z+-i + Y c n (n + <* + l)* n + a - (5.123) 



The smallest power of z occurring in (5.123) is z*" 1 , whose coefficient must be equated 
to zero. Thus 

ro(a - 2) - 

and a satisfies the indicial equation a(a 2) =0, so that a 0, 2. We shall see 
that the larger of these two roots produces no difficulty, the smaller of these two roots 
doGH produce a difficulty. 

The coefficient of z n+a ~ l t n = 1, 2, . . . , in (5.123) must he sot equal to zero. This 
leads to the recursion formula 

c(n + )(n + a - 2) + r n -i(n + ) - 
C =-n^~2 "-1,2,3,... 

If we try a = 0, we have c n c n ~\/(n 2) and C2 becomes meaningless. We try 
a 2 and obtain r n = r w _i/n, so that c\ = c , r 2 = n/2 = r /2, 



Ct/4 = Co/4!, and by induction r w ( I) n (c /w!). Hence 



(-!). = Az*e~* 



is a solution of zw" + (z I)t0' H- to 0. 

It is not very difficult to see that a second solution cannot be obtained 
by the method of series solution used in Examples 5.16 and 5.17 when 
the roots of the indicial equation differ by an integer. When the roots 
do differ by an integer the larger of the two roots presents no difficulty. 

ao 

Let w(z) = y c n z n+a be a solution of (5.114). Then 

n=0 

00 00 

w'(z) = c ( n + <x)* n+ ~ l w"(z) = y c n (n + a)(n + a - l)z n+a ~* 



n=0 



and substituting into (5.114) yields 



c n (n + a)(n + a l)*+*~ 2 _|_ pfy 

n^O n^O 



+ q(z) 2^ c n 2+ = (5.124) 
= o 
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Since zp(z) = po + piz + p&* + - , z 2 q(z) = q + q& + q& 2 + - , 
we have 



[c n (n + a)(n + -!)+ c n (n + a)(p + Pi + ) 

+ Cn(?0 + $!*+ O^- 2 = 

Of necessity 

a(a 1) + ap + <?o = indicial equation 

The coefficient of 2 n +- 2 must be zero, which implies 

c n [(n + a)(n + a - 1) + (n + a)p + go] 

n-l 

c.[( + a)p n -. + g-J n ^ 1 (5.125) 



Equation (5.125), the recursion formula for the c n , will determine c w , 
n = 1, 2, . . . , in terms of Co unless 

F(a, n) zz (n + a)(n + a - 1) + (n + a)p<> + q = 
= (n + a)(n + a. - 1 + p ) + ^o = 

for some integer n. If QJ, i are the roots of the indicial equation, a ^ i, 
we have a + i = 1 po, ai = ?o, so that 



F(a, n) = (n + a) (ft <*i) 
= n(n + a i) 

Since a ^ ai, F(a, n) 5^ f or n ^ 1. The only difficulty would occur if 
a ai = negative integer. To obtain a second solution when the roots 
of the indicial equation differ by an integer, we turn to the method of 
Frobenius. 

Method of Frobenius. We use (5.125) to determine ci, 02, . . . , c n , 
... in terms of Co and a. The specific value of a is not inserted. The 
series 



10(2, ) = c ( a ) 2 " + ( 5 - 126 ) 

n-0 

determined from (5.125) does not satisfy (5.114) but satisfies 

d*w(z, a) , , x dw(z. a) , / x / x 
^ + P(z) dz + ?(*)(, ) 

= C (a - i)(a - a 2 )2 a ~ 2 (5.127) 

where a\ and 2 are the roots of the indicial equation. The c n (a) of 
(5.126) are well defined for the larger of the two roots, say, a\. Now let 



216 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

Co = A (a <* 2 ) so that (5.127) becomes 
z, a) , , N dw(z, a) 



to dz 

= A(a - ax)(a - atfz~* (5.128) 

The change of c to A (a a 2 ) applies also to the coefficients ci, c 2 , . . . 
since these coefficients depend on Co. If i = 2, it is not necessary to 
replace Co by A (a 2 ). We differentiate (5.128) with respect to a. 
Since a and z are independent variables, we have 



il _ * - 

Evaluating at a = 2 yields 

_ f _JfM _|_ p(%) ( ) + q(z) ( ) = (5.129) 



( ) is 

\OQL / a^a-i 



Equation (5.129) shows that ( J is a solution of (5.114). 
Example 5.18. Returning to Example 5.16, we have 

Co Ci Co 

Cl ~ (1 -f a) 2 ' C2 = (2 + a) 2 ~ (1 -f a) 2 (2 + a) 2> ' ' ' ' 

Co 



Cn (1 -f a) 2 (2 "+)"" . ." . (n -f a) 2 ' 
so that 



*, a) 



+ ?! + . . .1 

^ (1 H- ) 2 (2 + ) 2 (n + a) 2 ^ J 
To compute (T J we need to compute 

AT i 1 

da 1(1 + a) 2 (2 + a) 2 (n + a) 2 J 
Let y - (1 + a)-*(2 -f a)-* - (n -f <*)~ 2 so that 

In y - -2 Y In (* + ) and - & = -5 

j y dot 



where F(n) 
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1 



n=0 

The second solution of zw" + w' w = is 



Example 5.19. Referring to Example 5.17, we have 

Co _ Ci _ Co _ C2 

Cl * ~ = C2 ~ ~ ~ - ' 3 



- l)a(a + 1)' 



" n (a - !)( + 1) ... (a + n - 2) 
We replace TO by Aa and obtain 

[y2 y3 

a ^ r Z H T 7 -__-__ -|- . . . 
a 1 a 1 (a l)(a + 1) 

+ ( ~ 1)n (^=T)Ti "+"!)( +2) ... (a" + ^^2) + ' ' J 
To compute ( ) > we need to compute 

\OCK / a0 

AT l 1 w , 2 

da L( -"!)( + !)( + 2) - - (a + n - 2)J a . 
Let y - (a - 1)-K + l)' 1 ^ + 2)" 1 ...(+ n - 2)- 1 , so that 

n-2 

In y - In ( - 1) - Y In (a + k) 



n-2 n-2 

dy 



_ kj (n -2)1 
If n = 3, ( ^ ) =0. Hence 



n-2 



w =4 

The second solution of zw" -f (z 1) w f + w = is 



w(z) - 5 2 6-* In 2 - 1 - z + z* - z* 
where F(n) -(-!)" 
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Problems 

1. Derive (5.113). 

2. Show that if u(z) satisfies (5.118) then u(z) satisfies (5.116). 
8. Derive (5.125). 

4. Solve zw" w 0. 



S z n+l 
nl(n + 1)! 

N-0 

00 

[V z n+l 
ln Z 2, nl(n + 



N-0 

00 

2 1 



_ 1!2! 

n-O 

. JL ( 2 + 2 + n + . . 1 

2!3!\1 2 ^3/ ^ ] 

5. Solve 2 2 (1 z)w" -f- 2(1 Zz)w' w - for 10(2) in the neighborhood of 

2=0. 

6. Solve z(l z)w" (1 + z)w f H- w for 10(2) in the neighborhood of 2 0. 

7. Solve 4 8 ip / ' + 4zw f ( J + l)w for if;(2) in the neighborhood of 2 0. 

5.11. The Point at Infinity and the Hypergeometric Function, To 
determine whether the point at infinity is an ordinary or regular singular 
point, we let z = l/t and investigate the point t = 0. We have 



dw _ dwdt^ __ . 2 dw 
Ih ~ llidz ~ ~ ~di 



= /4 _^ + 2p 
az* dt 2 dt 

so that w" + p(z)w' + q(z)w = becomes 

Hence z = oo is an ordinary "point if 

alytic at t = 

(5.131) 



(i) 2i is analytic at t = 



(ii) -4 q ( - J is analytic at / = 



(i 
For this case a solution can be found in the form 



Again z = oo is a regular singular point if z oo is not an ordinary 
point and if 



1 l\ 

(iv) 72 # \ 7 I 
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(5 ' 132) 



(iii) 2t ~ is analytic at / - 



MO- 



is analytic at t = 
* v / 

This implies 

1 /A 

+ Pit + pzt* + ' ' 



othat 



If, moreover, we desire that the origin also be a regular singular point 
we must have p(z) = po/z, <?(z) = <?o/z 2 . 

Example 5.20. We look for the differential equation (of second order) which has 
but one regular singular point at z = 0. In this case z * is an ordinary point so 

that 



and n(*> - 2 - _ g + . . . 

VW-S+S+--- 

In order that 2 = be a regular singular point, we must have 

PO = Pi = ' ' = P = - 00 - 01 = ' - tfn -- 

Hence g(z) s 0, and p(z) = 2/2, so that 



We consider now the following problem: We look for the differential 
equation which has exactly three regular singular points at z = 0, 1, ; 
all other points are to be ordinary points. Since p(z) has at most simple 
poles at z = 0, z = 1, we know that z(l z)p(z) is an entire function. 
Hence 



2(1 - 2) P (2) - " (5.133) 

n-0 

for all z. Since z = is to be a regular singular point, condition (iii) 
must be upheld. From (5.133) 
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(0 



t - 



y x 

MO; 

must be analytic at t 0. This is possible if and only if a n = 0, n > 1. 
Thus z(l 2)p(z) = a + ai-s 

and p(z) = - + r-A_ (5.134) 

# l <s 

With the same type of reasoning the reader can show that 



Let the roots of the indicial equation at z = <*> be a = a, a == 6. 
We also impose the condition that the indicial equations at z = and 
2 = 1 have at least one root equal to zero. Now at z = we have 

7>o = lim zp(z) = c from (5.134). Also q<> = lim z 2 q(z) = pvB. The 
*-o s-o 

indicial equation at 2 = is 

a(a - 1) + ca + \ivB = 

If a = is to be a root of this quadratic, we must have iivB so that 
we pick \i = if we desire q(z) ^ 0. The other root is a = 1 c. Let 
the reader show that, if one of the roots of the indicial equation at z = 1 
is zero, the other is a = 1 + A and 

M - i^i) (5 - 136) 

At z = oo we have 

2t - p(l/t) 2t - ct - tA/(t - 1) 2 - c - A/(t - 1) 



t* t 



and po = lim t 2t ~ = 2 - c + A 

t-^Q I 

Similarly g = B, and the indicial equation at z = oo is 

a(a - 1) + (2 - c + A)a + B = 
or a 2 + (1 - c + A)a + B = 

If the roots of this equation are a and 6, we must have 

a + b -1 +c- A 
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ab = B, so that A=a + b + l c. Our differential equation is 

d?w (c l-c + a + b\ dw ab _ 

dz* \z z 1 / dz z(z 1) v 

Equation (5.137) is the famous hypergeometric differential equation. 
Its solution is written in the form (after Paperitz) 

( oo 1 \ 

w(z) = P I a z\ (5.138) 

\1 c 6 c a b } 

The top row denotes the regular singular points, and the other rows 
contain the roots of the indicial equations at each singular point. Note 
that the sum of the roots is unity. 

The operator 6 = z-r- due to Boole is very useful in solving (5.137) 

by series methods. The reader can show that 9 has the following 
properties, 



e n (z<f>(z)) = z a (e + a)v 

n 

where 6V = B(O^), etc., and (0 + )V = /^ {) a n ~ r BV. 

r = 

The reader can show that (5.137) can be written in the form 

6(6 + c - l)w = z(6 + a) (8 + b)w (5.140) 

We solve (5.137) or (5.140) by series. Let 



W = > Cfc*+ a 



Then Qw = c*6z*+ = c*(fc + a)z k + 

*=o fc=o 

(6 + V)w = 7 c fc (fc + a + b)z k + 
ffo 

00 

9(0 + 6)w = Y c k (k + a + b)(k + a)z k + 
*% 

00 

z(B + a) (6 + b)w = y c*(fc + a + b)(k + a + a)2^ +1 (5.141) 
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Also 9(9 + c - l)w = c *(k + <**)(k + a + c - l)z k+a (5.142) 

ib-O 

Equating (5.141) and (5.142), we obtain the recursion formula 

c*+i(/c + a + l)(fc + a + c) = c k (k + a + a)(k + a + b) 
along with the indicial equation 

a(a + c - 1) = 
For a = we have 

(k + a)(k + b) , 



^i x 

so that Ci = ^ Co 

1 c 

(a + 1)(6 + 1) (a + l)q(6 + 1)6 Co 
C2 2(c +1) (c + l)c 2! 

and, in general, 

( q + ^ ~ 1) * * * a(b + n 1 ) b C 
Cn (c + n - TT 7 " 7 " 7 ^ 



_ 4 r(o + n)r(b + n) _^ 
~ T(c + n) w! 

where F(a) is the gamma, or factorial, function of Chap. 4. A formal 
solution of (5.137) is 

w(z) - A V r(a + n)F(6 + n) 2n T 

W(Z} ~ A Li T(fTnT~ ^ ( 

n-O 

a, 6, c cannot be negative integers. Why? 

We define the hypergeometric function F(a, 6; c; z) by the equation 



n)T(b + n) i" , K . AA . 

, r(c + ) ^ (5J44) 



Problems 

1. Solve w?" + 2w' = for 10(2) in the neighborhood of z ~ 0. 

2. Derive (5.135). 

8* Let w(z) be a solution of Legendre's differential equation 



DIFFERENTIAL EQUATIONS 223 

Show that 

(1 -1 oo I 

w(z) - P {0 n -f 1 z\ 
(O -n | 
4. Verify (5.139). 
6. If c is not an integer, show that a second solution of (5.137) is 

/ x ,- c \ T(n + 1 + a - c)r(w 4- 1 + 6 - c) z n 
w(z) ~ 



T(n + 2 - c) n\ 

n-0 



6. What is the interval of convergence of (5.143)? 

7. Show that 



by expanding the left-hand sides in Taylor-series expansions. 

8. Show that when c = 1 the second independent solution of (5.137) is 



r(a)r(b)F(a, b; 1; ,) In , 

r = l 

, V / _ 1 , _ 1 2\ 

where s r / [ ; -- 7 -f 7 ; --- 7 I 
Li \a 4- n 1 b + n 1 n) 

n=*l 

5.12. The Confluence of Singularities. Laguerre Polynomials. In 

the discussion of the hypergeometric function and its associated differ- 
ential equation the regular singular points 0, 1, were distinct. It 
turns out to be useful to consider what happens when two or more 
regular singular points approach coincidence, a process usually referred 
to as the " confluence " of singularities. The reader is referred to Chap. 
XX of Ince's text on differential equations for more details and for a very 
useful classification of linear second-order differential equations accord- 
ing to the number, nature, and genesis of their singularities by confluence. 
We consider Example 5.21 by way of illustration. 

Example 5.21. Rummer's Confluent Hypergeometric Function. Consider the hyper- 
geometric function 

w(z) - *Fi(a, b; c; z) 

+ 1) ' ' (o + * - 1)W + 1) (6 + n - 1) af , fi 
c(c + 1) - - - (c 4- n - 1) - n\ (5 ' 
n0 

for \z\ < 1, which satisfies 

2(1 - z)w" + [c - (a -f b -f l)z]w' - abw - (5.146) 
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Now let z' = bz, and then replace z' by z. We obtain 



fa, b; c; g 



fc + 1) (q + n - l)b(b + 1) - (6 + n - 1) g ,.-.- 
c(c -f 1) (c + n - 1)6 SI (6 * 147) 

n = 

which converges for \z\ < 6. 2^1 satisfies 

* (l - Q w" + (c - a + fr + 1 g) tf - aw - (5.148) 

If we let 6 > oo , we obtain (formally) the series 

F ,. . A _ V q(q + 1) (fl + n - 1) 2" ,. 14Q , 

fi(a,c,*) - c(c + 1} . . . (c + w . 1} -, ( 5 - 149 ) 

n = 

which certainly converges for |z| < 1. Moreover it can be shown that iFi(a; c; 2) 
satisfies 

zw" + (c - z)w' - aw - (5.150) 

obtained formally by letting fr oo . Equation (5.150) is Kummer's confluent hyper- 
geometric equation. It has a regular singular point at z = 0, but z = is an 
irregular point, that is, z = oo is neither an ordinary point nor a regular singular point. 
If c is not an integer, the other solution of (5.150) is 

z l ~ f iFi(a - c + I ; 2 - r; z) (5.151) 

If r is a negative integer or zero, (5.149) becomes meaningless, while if c is a positive 
integer greater than 1, (5.151) becomes meaningless. If c = 1, both solutions coin- 
cide. The second solution can then be obtained by the method of Frobenius. 

We now consider the function iFi(a; c + 1 ; z). The series representa- 
tion of iFi(a; c + 1; z) will terminate if a is a negative integer, w, for 
the coefficient of z m+1 is 



(c + l)(c + 2) - (c 

Similarly higher powers of z have zero coefficients. Hence iFi( n, c + 1 ; 
2) is a polynomial of degree n if n is a positive integer. We define 

co (c + D(c + 2) (c + n) iFi{ _ n . c + 1; g) 



/C" I 



as the associated Laguerre polynomial, n a positive integer. The reader 
can show that 

p + fr.M-oSa+.vw.. (6 . 153 ) 
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We show now that we can write 

n 

(e-*z+<) (5.154) 



Let v(z) = e-*z n+c so that ~ = <r\(n + c)z n+c ~ l - z n+c ] and 

dv / i \ 

z j- = (n -f c z)v 

Differentiating n + 1 times with respect to z yields 



d n v 
If we let u = T~ > we have 



l " C) + (n + 1}l " = (5 * 155) 

d n 
Now let w(z) = e^~ c w(g) = e z z~ -^ (e~ z z n+c ). If n is a positive integer, 

it is easy to see that w(z) is a polynomial of degree n. From 

u(z) = w(z)e~ z z c 

and the fact that u(z) satisfies (5.155) let the reader show that 
satisfies 



+ (c + l " 0) + nw = (5>156) 



Since (5.156) is exactly the same as (5.153), we must have 

Lf(z) = Kw(z) = Ke*z~ c ~ (e~*z n+c ) 
Let z = 0. From (5.152) the reader can easily show that 

norm - (c + 1)(c + 2) (* + n) 

- - ~ 



Let the reader show also that 



e l z~ 



2 = 



= (c + D(c + 2) (c + n) 



1 
n\ 



so that K = This proves (5.154). 
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The associated Laguerre polynomials have the interesting and impor- 
tant property 



e-*x c L%(x)U*(x) dx = m ^ n (5.157) 

To prove (5.157), we need only show that 

f* er x x c x m D n c) (x) dx = f or m < n (5.158) 

If (5.158) holds, then (5.157) is valid, for L*(x) is a polynomial of degree 
m < n and (5.157) is simply a linear sum of integrals of the type (5.158). 
Equation (5.158) can be obtained by integrating by parts m times after 
replacing U(x) in (5.158) by (5.154). 

Problems 

1. Show that iFi(a; c; z) satisfies (5.150). 

2. Show that L ( n c) (*) satisfies (5.153). 



3. Show that 14- (0) - 

4. Prove (5.158). 
6. Show that 



r(c) dx _ T(c + n + 1) 

Hint: Evaluate /_ e~*x f x n L% ) (x) dx. 
6. Show that 

n>L^ } (z) (2n + c 1 - zJL^^z) (n -\- c 

n ^ 2 



/. 
/.- 



W dx - WW -LSIM 
^ 0) is called the ordinary Laguerre polynomial. 
7. Show that L(z) 



r-O 



5.13. Laplace's Equation. We discuss now a method for solving 
Laplace's equation. This equation is of some importance in mathe- 
matical physics. We consider V 2 F = in rectangular coordinates, 



(5.159) 

V/.C- ^^i/ VXfc> 

Let us look for a solution in the form 

V(x, y, z) = X(x)Y(y)Z(z) (5.160) 



DIFFERENTIAL EQUATIONS 227 

This attempt at finding a solution of (5.159) is called the method of 
separation of variables. Applying (5.160) to (5.159) yields 



x ^ = 

For V 7* we have upon division by V 

1 d*X 1 d 2 F 1 d*Z 
X dx* + Y dy* + Z dz* 

ld*X ld*Y ld*Z 
Thus -J^ == YW + Zd^ ( 

The equality in (5.161) cannot hold unless both sides of (5.161) are con- 

flu 
starit, for if u(x) = v(y, z), then = and u SE constant. Hence 



. . , M 

~ V-J-T = constant = A- 
X ax 2 

or jj^ k*X = (5.162) 

The solutions of (5.162) are 

X = A k cos kx + B k sin kx 

or X = A k e k * + B k e~ kx (5.163) 

or X = Ax + B if k = 

Similarly 



and Y = Ci cos ly + DI sin /?/ 

or Y = Cic^ + jDic-^ (5.164) 

or F * Cy + D if Z = 
Finally 



or ^f + ( + fc 2 T / 2 )Z = (5,165) 



The solution of (5.165) depends on the magnitudes of k 2 and I 2 and the 
signs preceding k 2 and Z 2 . 

The choice of the sign preceding the constants depends on the nature 
of the physical problem. We illustrate with Example 5.22. 

Example 5.22. For steady-state heat flow we have V*T 0. We consider a two- 
dimensional semi-infinite slab of width o. The edge given by x 0, Q y < ois 
kept at constant temperature T 0, as is the edge given by a: a, # < co. 
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The base of the slab given by y 0, ^ x ^ a is kept at a steady temperature given 
by T /(x). For the steady-state case we have 

d*T ,d*T_ n 
a? "ay* ~ 

If we let 2T - ^"(a:)F(j/), we obtain as above 

1 d*X ld*Y 



If we choose our constant to be negative, we have X Ake kr + Bke~ kx . Our bound- 
ary condition necessitates, however, that T when x = and when x = a for all y. 
This cannot be achieved for X Ake kx -j- BkC~ kx unless Ak Bk 0. However, if 
we choose 



_-= 
X dx* 

then X = A k cos for -f #* sin kx. Since sin (rnrx/a) vanishes for x = and x 
provided n is an integer, it seems proper to consider 



v n . nirx 
X = B n sin - 
a 

where fc = rwr/a. It follows that 

d*Y _ nV 

dy* a* * ~ U 

so that Y = C n e (nir/a)v + D n e~ (nv/a}v . We do not expect the temperature to become 
infinite as t/ becomes infinite ; so we choose C n * 0. 



Hence r(or, y) = B n D*e-<*'** sin ~ 



(5.166) 



where n is an integer. The final boundary condition is T(x, 0) =/(z). Equation 
(5.166) cannot, in general, satisfy this boundary condition, unless, by choice, we pick 
(x) A sin (nirx /a). Now V*T is a linear partial differential equation, and it is 
an easy thing to show that 

00 

T(x, y) = T A n e-< n */ a)v sin (5.167) 

^ a 

n=0 

is also a solution of V 2 !T provided the series converges and can be differentiated 
twice term by term. To satisfy the boundary condition T(x, 0) f(x), we need 

00 

f(x) = Y ^sin (5.168) 

L* a 

n=0 

Can the infinite set of constants A n , n - 0, 1, 2, . . . , be found so that (5.168) holds? 
This is the subject of Fourier series and will be discussed in Chap. 6. 
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Problem 1. Find a solution of V 2 F - such that V - for x - 0, x - o, V - 
for y = 0, y 6, F for z -f . 

Problem 2. By the method of separation of variables find a solution of 

d*V a*F 67 

dx 2 "*" ay 2 a* 

Assume 7(s, y, - X(x)Y(y)T(t). 
Problem 3. Do the same for ~^ + ^ - 4 ^?- 

as 2 ay 2 c 2 a* 2 

We find now a solution of V 2 F = in spherical coordinates by use of 
the method of separation of variables. In spherical coordinates Laplace's 
equation is 



' 3V I .4- _ I , R _ I J. - 

I -4- I ait, /I I -L- ___ 

Assuming F(r, 0, <p) = JB(r)9(0)^(^), we obtain upon division by F 

sin 2 d ( 2 d#\ . sin rf / . _ c?6\ 1 
p TT~ \ ' "^7~ ) ~l I sm ^ ) = 

\ / \ twv J *tr 

Let the reader show that of necessity 



T-J = constant 
Now physically we expect and desire that 

V(r, B, <p) = 7(r, 0, ? + 2r) 

Let the reader show that this implies that the constant be chosen as the 
square of an integer, that is, 



Thus $> = A n cos n<p + B n sin n<p. Equation (5.169) now becomes 
1 d ( 2 dfl\ 1 d { . . de\ , n 2 /c 

fi 5 V* ^J = " eSO 3 V sm ' 3 j + SETS ( 5 

and this implies that 

1 d ( , dR\ . . . 

RTr( r 2 ^TJ = co^tant = fc 

r^ + 2r^-*.0 (5.171) 

To solve (5.171), we assume a solution in the form JB(r) = r m . This 
yields r m [m(m - 1) + 2m - fc] = 0. Hence r m is a solution of (5,171) 



230 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

provided m(m + 1) = fc. Similarly r- (m+1) is a solution of (5.171) for 
k = m(m + 1). The most general solution of (5.171) for k = m(m + 1) 
is 

R(r) = C m r- + D m r-<+ (5.172) 

We need not have guessed at a solution of (5.171), for it is immediately 
obvious that r = is a regular singular point of (5.171). The series 
method of Sec. 5.10 could be used to obtain (5.172). Equation (5.170) 
now becomes 

sin 6 ~ (sin 6 ^J + [m(m + 1) sin 2 6 - n 2 ]9 = (5.173) 
If we let fji = cos 0, dp = sin d6, (5.173) can be written 




~ M2) : + [m(m 

(1 - M 2 ) - 2^0 + [m(m + 1) - J-5L-J 9 - (5.174) 

Equation (5.174) is called the associated Legendre differential equation. 
Its solution in Paperitz's notation is 

oo 1 \ 

(5.175) 



Problem 4. Deduce (5.175). 

Problem 6. If V is independent of <?, that is, n = 0, (5.174) becomes 

(1 - ) - 2 M + m(w H- 1)P = (5.176) 

with 6 replaced by P. For m an integer show that the solution of (5.176) in the 
neighborhood of n 0, written P,(M), is a polynomial of degree m. Also show that 

^-0*)P(M) ^M - for m 5* n 

The P m (/*), m 0, 1, 2, . . . , are called Legendre polynomials. We shall have a 
great deal more to say about such polynomials in Chap. 6. A solution of V 2 F for 

V - F(r, 0) is 

00 

V(r, B) - y (A m r + B,,r-<-+'>)P, (cos 9) (5.177) 



We have seen that the solution of Laplace's equation in spherical 
coordinates led to Legendre polynomials. The solution of Laplace's 
equation in cylindrical coordinates yields the Bessel function. In cylin- 
drical coordinates V 2 F becomes 
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3*F . ,_ ,_ 

W = Q (5 ' 178) 

Assuming V(r, 6, z) = R(r)Q(B)Z(z) yields 

(5179) 
l *' 



_ 
Rdr\ dr Z dz* 9 

If we desire F(r, 0, 2;) = F(r, + 2ir, z), we need 



2 
9 d02- y y an integer 

so that B(^) = A, cos ^ + B v sin ^ (5.180) 

Similarly 

1 d * z / 2 7 i 

7 TT ~ ^ ^ rea * or a P ure imaginary 

so that 

Z() = C**- + D*e-*' (5.181) 

If fc is real, Z(^) is exponential, and if k is a pure imaginary, Z(z) is 
trigonometric. Equation (5.179) now becomes 



If we let z = fcr, /? = w, (5.182) becomes 



This is Bessel's differential equation. We see that z = is a regular 
singular point. 

Problem 6. Solve (5.182) for k and v an integer. 

Problem 7. Solve (5.183) for the cases v = integer, v ^ integer. Hint: Let 

z 1 - 4s, 6 x -T-, and write (5.183) as 

(02 - -J-,,2) w + xw a 
oe 

Let 10 = x* ^ o r x r , and show that 

(~x) r 



Wi(x) Ax v/ ' 

r-O 

If p is not an integer, a second solution is 
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We define the Bessel function of the first kind of order v to be 






An important recurrence formula for the J r (z) is 



, +l (z) = J,(z) (5.185) 



To prove (5.185), we note that 

/A" 1 V (-l)-(s/2)* 
^ LI r!l> + r) 

r-O 

rv (-: 



LI r\T(v + r + 2) 

oo 

\ \ Z / 4j 



r=0 r-O 



V 



+ r"+ \) 

r=l 

1 



^2/ IT(v) ' Ll r\T(i> + r + 1) \ 4 
However, 

ao 

V (-2 2 /4)-- 

+ 1) + Li r\T(v + r + 1) 



?! j (g) _ 2j; /*v r i 

z Jr(Z) - z W lT(v 



^ r 

2J L 



4. v 



r-1 

Equation (5.185) is seen to hold since v/T(v + 1) = 
Problem 8. Show that 



/,_,(*) - J,(g) - 2J' t (z) (5.186) 
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Problem 9. From (5.185) and (5.186) show that 



\J v (z) -J' v (z) = 



Problem 10. Show that 



(2 \i 
j sinz 

Problem 11. Show that, if a 7* ft, v > 1, 

(' - 0) J o V,(rf)J,(0) dl = X [j v (ax) ^^1 _ / 
2a 2 f*t[J,(at)]*dt - (* - ^ 2 )[/,(ax)] 2 -f 

Problem 12. From the results of Prob. 11 show that 

tf,(at)J,(0t) dt - 
provided J(a) J(ft) 0, a 5^ j8, ^ > 1. Also show that for this case 
f*t[J,(at)]*dt =[/v+i()P 

5.14. Hermite Polynomials. The differential equation 

$ -,* + (5.187) 

arises in quantum mechanics in connection with the one-dimensional 
harmonic oscillator. We shall see that the Hermite polynomials defined 
by 

H n (t) = (~l)VV2^g!_ 2 (5 . 188) 

satisfy (5.187) for any nonnegative integer n. The reader can easily 
verify that H n (t] is a polynomial of degree n. From (5.188) we have 






so that = n^ B _i(<) (5.189) 

Problem 1. Show that 

(5.190) 
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Problem 2. From (5.189), (5.190) show that ~? - t f + nH n - 0. 
From (5.189) we have 



and, in general, 



d r H n (f) f 1N , , 

d P = n(n - 1) (w - r + 



and =_H_ (0 (5.191) 

From Taylor's expansion theorem arid using (5.191) we obtain 



^n- 2 C)"~- 



We now look for a generating function <p(z, i) in the sense that 

00 

v(z, t) = //(/) (5.193) 



If v?(^, can be found, then the Taylor-series expansion of <p(z, t) in 
powers of z will yield the Hermite polynomials. If <p(z, f) does exist, 
then formally we have 



assuming that one can differentiate term by term. From (5.189) we have 



n=l n-1 



Integration yields 

^>(z, t) = 

where M(Z) is an arbitrary function of integration. Thus 

I _|_ p'(z)] 

(5.194) 
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From (5.193) we have formally 



n=l n=0 



- V if m z " 

~ Li //B+2(0 JH 



so that ~ = tJ/ " +2( ~ tHn+l(i)] 



5 



w-0 



00 

- <n 

n-0 



+ !)#) rr 



eta 

= 2 - (p 

Making use of (5.194) yields 

M "(z) + [M'(*)l 2 + (^ + f)fjL f (z) + zt = -1 (5.195) 

Since z and t are independent, we find that p(z) must satisfy // + z = 0. 
Now /i(z) = 2 2 /2 satisfies (5.195). The function 

^(2, /) s <-/*+* (5.196) 

can be shown to be the generating function for the H n (t). There is no 
trouble in differentiating the series expansion of <p(z, t) term by term 
since <p(z, t) is analytic for all 2, t in the complex domain. 
Finally, we show that 

JS Jjt if m 7* n \ 

t}dt = n\ ifm = n ( 5 ' 197 ) 



We consider 

1=1 e-^H^HnW dt 
J -*> 

/* /7mp-V 

^C-D-H-W^r- 

Integration by parts jdelds 



f^oo rf< 

(-l)-i n / H n -i(t)~^^-dt Why? 

J tO wW 
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Further integration by parts yields 






/* 
^ 



dt 

If m > n, let the reader deduce that 7 = 0. If m = n, 
/ = n! 

REFERENCES 

Agnew, R. P.: " Differential Equations," McGraw-Hill Book Company, Inc., New 

York, 1942. 
Bateman, H.: " Partial Differential Equations of Mathematical Physics," Dover 

Publications, New York, 1944. 

Cohen, A.: " Differential Equations," D. C. Heath and Company, Boston, 1933. 
Goursat, E.: "Differential Equations," Gmn & Company, Boston, 1917. 
Ince, E. L.: "Ordinary Differential Equations," Dover Publications, New York, 

1944. 
Kells, L. M.: "Elementary Differential Equations," McGraw-Hill Book Company, 

Inc., New York, 1947. 
Miller, F. H.: "Partial Differential Equations," John Wiley & Sons, Inc., New York, 

1941. 
Petrovsky, I. G.: "Lectures on Partial Differential Equations," Interscience Pub- 

lishers, Inc., New York, 1954. 



CHAPTER 6 

ORTHOGONAL POLYNOMIALS, FOURIER SERIES, 
AND FOURIER INTEGRALS 



6.1. Orthogonality of Functions. A function f(x) defined over the 
interval (a, b) may be thought of as an infinite dimensional vector. 
f(xo) is the component of f(x) at x = x . Let g(x) be defined also over 
the interval (a, 6). In vector analysis one obtained the scalar, dot, or 
inner product of two vectors in a cartesian coordinate system by multi- 
plying corresponding components and summing. Obviously we cannot 
sumf(x)g(x) for all x, a ^ x ^ b. The next best thing is to define 

S(f,g) = f' f(x)g(x) dx (6.1) 

as the scalar product of / and g. If S(f, g) = 0, we say that / and g are 
orthogonal on the range (a, b). Generalizing, if 

r& 



/ 

Ja 



p(x)f(x)g(x} dx = (6.2) 



we say that / and g are orthogonal relative to the weight, or density, 
function, p(x), on the interval (a, b). 

6.2. Generating Orthogonal Polynomials. In Chap. 5 we noticed that 
important classes of polynomials such as the Legendre, Laguerre, and 
Hermite polynomials arose in connection with the solutions of various 
differential equations. In this section we shall show how to generate 
orthogonal polynomials. The various theorems proved here will apply 
to every type of polynomial generated. We proceed as follows: 

Let (a, b) be any closed interval, oo ^ a ^ x ^ b ^ >, and let 
p(x) be any real-valued function satisfying the following conditions: 

(i) P(X) ^ for a < x < b 

\ / f\ / 

(ii) [' p(x) dx > for a g a < ft g b 

Joe 

(iii) The moments of p(x), say, ^n, defined by 
x n p(x) dx n = 0, 1, 2, ... 



I b 

Ja 



exist and are finite for n = 0, 1, 2, .... We call p(x) a weight, or 
density, function. 

237 
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We now proceed to construct a sequence, or family, of polynomials, 

P O (X) s i, /MZ), />,(*), . . . , p n ( X ), . . . 

such that 

P(X) = Z" + (TnX"- 1 + ' (6.4) 

and such that 

f b P (x)P m (x}P n (x) dx = m 3* n (6.5) 

yo 

Condition (6.5) states that P m (x) and P n (x) are orthogonal to each other 
relative to the weight factor p(x). Note that P n (x) has its leading 
coefficient equal to unity. We apply mathematical induction to generate 
the family {P n (x)\. We have already defined Po(x) = 1. Let 

Pl(z) = X + ffi 

To fulfill condition (6.5), we need 

fp(x)(x + <n)dx = (6.6) 

The limits of integration are omitted with the understanding that they 
remain fixed throughout the discussion. Equation (6.6) yields 



[see (ii) and (iii) of (6.3)]. This choice of o-i yields PI(X) orthogonal to 
Po(x). Now assume that we have constructed the sequence of orthogonal 
polynomials P (x), PI(X), P 2 (x), . . . , Pk(x) satisfying (6.5). We know 
that k is at least 1 . We now show that we can extend our constructed 
set of polynomials to include the polynomial P k +\(x) while preserving 
the important orthogonality condition (6.5). Let 

k 
P k+1 (x) = x*+' + c r P r (x) (6.7) 



where c r , r = 0, 1, 2, . . . , fc, are fc + 1 undetermined constants. We 
desire 

Jp(aOP.(x)P*+i(x) dx = a = 0, 1, 2, . . . , k (6.8) 

With the aid of (6.7), (6.8) becomes 

I x k +*p(x) dx+^Cr f p(x)P,(x)P r (x) dx = Q (6.9) 

r0 

However, for s ^ r we have fp(x)P(x)P r (x) dx = provided 5 g fc, 
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r g fc, by our assumption on the set Po, PI, Pi, . , P*(#). Hence of 
necessity 

_ fs* +1 p(x) dx _ ( . 

C ~ ) dx - <U> 2, . . . , * (6.10) 



The numerator of c s exists [see (iii) of (6.3)], and the denominator of c 
is different from zero and is a linear combination of various moments of 
p(x), each of which exists. We leave this as an exercise for the reader. 
It is a trivial procedure to reverse the steps taken to derive the c a , s = 0, 
1, 2, . . . , fc; that is, if the c of (6.10) are substituted into (6.7), one 
easily shows that P k+i (x) is orthogonal to P r (#), r = 0, 1, 2, . . . , k, 
relative to p(x). The principle of finite mathematical induction states 
that a sequence of polynomials Po(#), Pi(aO, Pz(x) f . . . , Pn(%), - - - 
exists satisfying (6.5). 

Problem 1. Prove by mathematical induction that the sequence of polynomials 
\P n (x} \ is unique relative to the interval (a, b) and the density function p(x). 

For the interval (0, 1) and for p(x) 1 the orthogonal polynomials are 
the well-known Legendre polynomials. We generate now PI(#), P*(x). 
For PI(X) = x + v^ we have 

"""T"^" ""5 

Jo 



so that l\(x) = x - |. T.et l\(x) = x* + a^Pi + A P . From 

P Pi dx = 



we have / x 2 dx + do / dx so that ao = -J. From 



we have f ! (x ^)jc 2 dx + a\ I 1 (x ) 2 dx = 0, so that ai = 1. 
Thus P,(a:) = x 2 - (x - i) - i = a- 2 - x + I 



Problem 2. For the interval (0, o) and for p(x] == e~* generate Li(x) and 
These are the Laguerre polynomials. 

Problem 3. Construct the Hermite polynomials HI(X), Hi(x) for the interval 

(-00, oo), 



An interesting result concerning orthogonal polynomials is the follow- 
ing: Let Q(z) be any polynomial of degree n. There exists a unique set 
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of constants CD, Ci, . . . , c n such that 

n 
Q(X) = J C r P r (x) C n * 

The constants depend, of course, on the family of polynomials under 
discussion. The proof is by induction. If Q(x) is of degree zero, say 
Q(x) = a, then Q(x) = aPo(x). Assume the theorem true for all poly- 
nomials of degree g k. Now let Q(x) be any polynomial of degree 
k + 1, Q(x) = aox k+1 + aix k + + Ofc+i, a ^ 0. Then Q(x) - 
a<>Pk+i(x) is a polynomial of degree ^ k. By our assumption 

k 
Q(x) - a P k+ i(x) = 



00 



so that Q(x) = Y c r P T (x) c k +\ = 



Problem 4. Show that r ri r 0, 1, 2, 3, . . . , k + ], in the preceding line are 
unique. 

Problem 5. Show that $ P (x)x n P m (x) dx = for m > n. 

Problem 6. If Q(x) is a polynomial of degree m < n, show that 

{p(x)Q(x)Pn(x) dx = 

Problem 7. Show that the conditions 

fp(x)x k P n (x) dx - k - ; 1, 2, . . . , n - 1 (6.11) 

yield a unique polynomial P n (aO = x n -f o*,,^" 1 H- and that 

) rfx = 



for m 7* n if P m (x) is generated^ in the same manner. Equation (6.11) could have 
been taken as the starting point for developing orthogonal polynomials. 

Problem 8. For a 7* < , & F* +< find a linear transformation x = rx -\- s such 
that 5=0 when x = a, x = 1 when x = 6. For a? > ; 6= s -f> find a linear 
transformation which maps x = a into x = 0, x = + into f + oo . 

Problem 9. Show that JxP B _iP n p dx JP^p dx. 

6.3. Normalizing Factors. Examples of Orthogonal Polynomials. 

The constants 

7* JPJ()p(a:) dx n = 0, 1, 2, ... (6.12) 

are called the normalizing factors of the P n (x). It follows that 
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We define the orthonormal set of functions {<p n (x)\ by 

<p n (x) = \/pW Pn(x) n = 0, 1, 2, . . . (6.13) 

Tn 

The <p n (x) are not, in general, polynomials. They possess the attractive 
property 



^ 
dx 



if i 7* j 
. f ^. = . 



The {<p n (x)} form what is known as a normalized orthogonal system of 
functions associated with the family of orthogonal polynomials {P n (x)}. 
We now list a few families of orthogonal polynomials. 





Interval 


Weight function 


Polynomial 


1 


x ^ 1 


1 


Legendre 


2 


-1 ^ x 1 


(1 - z 2 )-* or (1 - z 2 )* 


Tchebysheff of 1st and 2d kinds 


3 


^ z ^ oo 


e~ x 


Laguerre 


4 


^ z ^ oo 


x c e~ x , c > 


Generalized Laguerre 


5 


oo ^ x ^ oo 


e -*V2 


Hermite 



Example 6.1. Let a = 0, & = 1 with p(ar) = 1. We choose 

P n (x) - 1 4- a,jr + a 2 x 2 + -f 
From / x k P n (x) dx = for k < n we have 

yo 



j- T =0 for k - 0, 1, 2, . . . , n - 1 



fc + 1 ^Ib +2 ^ r k 4 

We can solve these n linear equations for the a t , i = 1, 2, . . . , n, as follows: We 
have 



<"+...+ a 



k + 1 ^ k +2 



n -f 1 " (k + l)(fc +2) 



+ n + 1) 



frhere Q(k) is a polynomial in k of degree at most n. Since Q(fc) vanishes for A; 0, 1, 
2, . . . , n 1, of necessity 

- 2) - (k - n + 1) 



(see Lemma 1, Sec. 6.4). Thus 



fe + 



(k - n + 1) 



'k+n + l^(k+2)---(k+n + l) 



If we set k l, we obtain 



C(-D(-2) 
1-2-3 
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so that C - (-l) n . Thus 

t~\ l + ai 4. ... + a ' 4. ... 4. 
(0 * + l^fc + 2^ ^ A; + r + 1 ^ + fc + n + 1 

. (-l) n fe(fe - 1) (A; -n 
(* -h 1)(* -f 2) - . (fc+n 



To solve for o r , we multiply the above by k + r + 1 and then set A; = (r + 1). 
This yields 

(-l)(r + D(r+2) . . . ( r + n)(-l) 
(-r)(-r -f- l)(-r + 2) (-1)1 2 (n - r) 



- - 

r!r!(n - r)! 

n n 

Hence P n (x) - ^ W - ^ ( " 1)r (r) ( W ^ 

r0 r-0 

The Legendre polynomial with leading coefficient unity is 



U;- 

We now determine / F^(x)p(x) c/x. We have 
P n 2 (x) dx - I* a H x n f\(x) dx 

n 

a w / a r / J r '^ 
^w yo 



7=0 

Qr _ (~l) n n!n! 



n+r + 1 ~ n (2n + D! 

r0 

from (a) above. Thus 

2 ^ /"^ n!yt! . (2")! . 1 
n)\n) (2n -f 1)! (2n + 1)! 2n H- 1 

6.4. The Zeros of the Orthogonal Polynomials. We wish to show that 
the zeros of P n (#) of Sec. 6.2 are real, distinct, and lie in the fundamental 
interval. We first state and prove two lemmas. 

LEMMA 1. If x = r is a root of P(#) = 0, then x r is a factor of 
P(x)j P(x) a polynomial. The proof is as follows: We have upon division 

= Q(#) H -- J? = a = constant 
x T x r 

or P(x) - Q(x}(x -r) + R 

If P(r) = 0, then R = so that P(x) = Q(x)(x - r). Q.E.D. 
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LEMMA 2. If x = a + i b is a zero of a real polynomial, P(x), a, 6 
real, then a i b is also a zero of P(x). It is easy to see that 

P(a + ib) = U(a, b) + iV(a, b) 
P(a - ib) = U(a, b) - iV(a, b) 

Since P(a + ib) = 0, we have U = V = so that P(a - ib) = 0. 

We now prove the general theorem stated above. Let P n (x) be a real 
orthogonal polynomial. If x = a + ib, b j* 0, is a zero of P n (x), then 
[x - (a + ib)][x - (a - ib)] s (x - a) 2 + 6 2 is a factor of P n (x). This 
factor is positive for all x in our fundamental interval (a, b). This is true 
for all complex zeros of P n (x). Now let x = x\, x = x 2 , . . . , x = XN 
be the real zeros of P n (x) of odd multiplicity lying on the interval (a, b). 
Certainly N ^ n. Assume N < n y and let 

Q(x) = (x - xi)(x - x,) (x - Ztf)Pn(z) 

Let the reader deduce that Q(x) > for all x on (a, 6) except for x = xi, 
2, . . , ZArorQ(z) < for all x on (a, b) except for x = i, 2:2, . . . ,XN. 
We have, however, 



p(x)Q(x) dx = p(x)(x - xi) (x - a*)PCc) d^ = (6.14) 

Ja 

since (x x\)(x 0^2) * * (x XN) is polynomial of degree N < n 
(see Prob. 6, Sec. 6.2). But f b p(x)Q(x) dx > or f b p(x)Q(x) dx < 

ya J a 

from the nature of (?(#), a contradiction to (6.14). Hence N = w, and 
the theorem is proved. 

6.5. The Difference Equation for Orthogonal Polynomials. We obtain 
now an equation involving P n _i(#), P n (x), and P n +i(z). We have 

P n+ l(x) = X n+l + <T n +lX n + ' ' ' 

P n (x) = X w + (Tn^- 1 + 
SO that Pn+l(x) - xP n (x) = (<T n +l - ff n }x n + ' ' ' 

and Pn+i(z) ~ xPn(x) 



since x n = P n (x) (r n x n ~ l . Hence 

P n+ l(x) - xP n (x) - (*+, - (T n )Pn(x) = P n +l(x) - (x ' 

is a polynomial of degree at most n 1. From a previous theorem 

n-l 
P-M(Z) - (X + <r n +! - (T n )P n (z) = Y C r P r (z) 

r0 

n-2 

'n^! + Y C r Pr(x) (6.15) 



244 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

We multiply (6.15) by p(x)P n -i(x) and integrate over the interval (a, 6). 
Using the orthogonality property and the fact that 

fxP^l(x)P n (x)p(x) dx = SP*(X)P(X) dx = 7* 

(see Prob. 9, Sec. 6.2), we obtain y* = c n _i7_i- Thus 

n-2 

C r P r (x) (6.16) 
r-0 

is a polynomial of degree at most n 2. To show that c r = for 
r = 0, 1, 2, . . . , n 2, one need only multiply (6.16) by p(x)P 9 (x) 9 
s g n 2, and perform an integration over (a, fe). Hence 

P n+1 (x) - (x + er n+1 - <r n )P n (z) + -- Pn-i(x) B n ^ 1 (6.17) 

Tn 1 

Equation (6.17) is the difference equation satisfied by the orthogonal 
polynomials. Using (6.17), it is possible to show that the zeros of P0r), 
P n +i(x) interlace in the sense that, if x\ < x 2 < < x n are the zeros 
of P w (x) and if t/i < y* < < y n < y n +i are the zeros of P M +i(a;), 
then a < yi < xi < y<t < X* < < y n < x n < z/ n +i < b. We omit 
this proof. 

6.6. The Christoffel-Darboux Identity. From (6.17) we have 

P tt +!(0 - (t + <Tn+l - <Tn)Pn(t) + ^ P-l(0 = (6.18) 

Tn-1 

so that 
P n (OPn+i(z) -(x 

- (t 



l(0 - 

upon multiplying (6.17) by P n (t) and (6.18) by P n (a;). Subtracting and 
dividing by 7^ yields 



We have also 



.. nil (6>20) 

Tn 



Pi(x)Po(Q - Pi(QPo(g) = (s - <rj - - en) ^ (x - QPo(g)P,(Q 

To To * To 

(6.21) 
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If we let n = 1, 2, 3, . . . , k iu (6.20), add, and make use of (6.21), we 
obtain 

P k+l (x)P k (t) " P W (QP(X) _ V Pn(x)P n (t) , . 

( } 



-0 

n-0 

Equation (6.22) is the Christoffel-Darboux identity. We define the 
kernel, K k (x; t), by the equation 

VA p ( r \p (f\ 

Kk ( X ; - X r 'Wf' W (6-23) 

^ in 

n-0 

Problem 10. Evaluate Kk(x; x) by L'Hospital's rule. 

Problem 11. If x = c, x d are distinct zeros of Pjb(), show that K k (c', d) = 0. 
What if c = dl 

Problem 12. Prove that 

f* K k (x-, t) P (t) dt - 1 (6.24) 

6.7. Fourier Coefficients and Partial Sums. We may redefine the 
orthogonal polynomials so that 

p(x)pm(x)p n (x) dx = 6 mn * J if HI n 1 (6 ' 25) 

by simply letting p n (x) = (l/7n)Pn(x), n = 0, 1, 2, ... (see Sec. 6.3). 
Now let us consider a real- valued function of x whose Taylor-series expan- 
sion about x = exists. We have 



( 6 - 26 ) 



We see that f(x) can be expanded as a linear combination of terms from 
the sequence of polynomials 

1, x, x 2 , x 8 , . . . , x n , . . . 

f(x) = 



n-0 (6.27) 



It seems logical to ask whether a given function f(x) can be expanded in 
terms of the infinite sequence of polynomials 

po(aO, Pi(a0, p(*), . . . , Pn(x), . . . (6.28) 
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If this is the case, we may write 



/(*) = 

n-0 

To determine the coefficients c n , n = 0, 1, 2, . . . , we multiply (6.29) by 
p(x)p k (x) and integrate. This yields 



p(x)p k (x)f(x) dx = j a c n p(x)pk(x)p n (x) dx (6.30) 

n-0 

Assuming further that the series expansion converges uniformly on (a, 6) 
yields 

00 

p(x)p k (x)f(x) dx = y c n f p(x)p k (x)p n (x) dx = c k (6.31) 



n-0 



The c k , k == 0, 1, 2, . . . , of (6.31) are defined as the "Fourier" coeffi- 
cients of f(x). If I p(x)f*(x) dx exists, then from the Schwarz-Cauchy 
inequality for integrals we have 

|c*| 2 g I" P (x)f*(x) dx l b P (x)pl(x) dx (6.32) 

Ja J& 

Thus c*, given by 

c k = I b p(x)pk(x)f(x) dx k = 0, 1, 2, . . . (6.33) 



exists provided / p(x)f 2 (x) dx exists. Thus one can speak of the Fourier 

Ja 

coefficients of f(x) regardless of the existence or nonexistence of the expan- 
sion of f(x) in the form given by (6.29). 

If the c k , k = 0, 1, 2, . . . , n, of (6.33) exist, we can form the series 



s n (x) is called the nth partial sum of the Fourier series associated with 
f(x). With the aid of (6.23) and (6.33) we have 



(6.35) 
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The difference between f(x) and s n (x) is called the nth remainder in the 
Fourier series of /(x). We have 

.(*) - /(x) - (*) 

= f a b f(x)K n (x; t)p(t) dt - f' K n (x; t)f(t)p(t) dt 

= f' > K n (x;t)p(t)[f(x) -f(t)] dt (6.36) 

JO, 

since f b K n (x; t)p(t) dt = 1 [see (6.24)]. Now 

Jo. 



so that 



(^ (pn+l(x} Pn (0 ~ p n +l(t) Pn (x)] f(X} ~ f(t) P (t) dt (6.37) 

ja X C 



fn 

The reader should explain the apparent difficulty at t = x. If 

lim R n (x) = 

n > oo 

00 

the Fourier series, Y c n p n (^), converges to/(x). 

n-O 

6.8. Bessel's Inequality. We show first that R n (x) = f(x) s n (x) is 
orthogonal to Pk(x), fc = 0, 1, 2, . . . , n. We have 

f b [/(*) - s n (x)]p k (x)p(x) dx = [ b f(x)p k (x)p(x) dx 

Ja Ja 

n 

Ci t pt(x)p k (x)p(x) dx 



i-O 



t-0 

= c fc - c* = (6.38) 

We show next that 

n 

f'f*(x) P (x) dx fe J t| (6.39) 



t-o 



The proof is as follows: Clearly 



[/(x) - s(x)]V(x) dx * 

so that l*j*(x)p(x) dx-2 f(x)Sn(x)p(x) dx + f* sl(x)p(x) dx 

(6.40) 
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However, 

n 

f(x)s n (x)p(x) dx = c k f(x)p k (x)p(x) dx 



-I 

n 

and f s*(x)p(x) dx = Y Y c^c* f Pj(x)p k (x)p(x) dx 



fc 
fc^O 
n n 



so that (6.39) follows as a consequence. Since (6.39) holds for all n, the 

n 

sequence t n = Y ef, n = 0, 1, 2, . . . , is bounded. Moreover, this 
sequence is monotonic nondecreasing. This implies that lim t n exists. 

n > oo 

Let the reader deduce that 

80 

f*(x)p(x) dx^ cl (6.41) 

Formula (6.41) is BessePs inequality. 

Problem 13. Why is it true that lim c k = 0? 

Problem 14. If f(x) is a polynomial of degree n, show that 

dx - 

Problem 15. For (a, 0) such that a ^ a ^ /8 ^ 6 assume that 

f 

^n = / g(x)pn(x)p(x) dx 
Jot 

exists. Let f(x) = g(x) for a ^ x ^ /3, /(r) = otherwise. Show that lim d n 0. 



6.9. A Minimal Property of the Partial Fourier Sums. We look for a 
polynomial Q(x) of degree g n such that 

/= f [/(*) - Q(x)]*p(x)dx 
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shall be a minimum. Let s n (x) be the nth partial Fourier sum of f(x). 
Then 



= P [/(*) - (*) + (*) - Q(x)]*p(x) dx 

Ja 

[/(*) - s(z)] 2 p(*) dx + 2 [" 

ya 



The second integral vanishes from the results of Sec. 6.8 [see (6.38)] since 
s n (x) Q(x) is a polynomial of degree ^ n. The first integral has a 

fixed value. Hence / will be a minimum when / [s n (x) Q(x)]*p(x) dx 

Ja 

is a minimum. This obviously occurs if we choose Q(x) s s n (x). This 
is a highly important property of s n (x). 

Problem 16. Let Q n (x) be a polynomial of degree n with leading coefficient unity. 

/b 
Ql(x)p(x) dx is to be a minimum, show that Q(x) = P(x). 

Problem 17. Let {P n (x) \ be a family of polynomials with leading coefficients equal 

rb 

to unity, P n (x) of degree n such that / P*(x)p(x) dx is a minimum. Without using 

Ja 

the result of Prob. 16 show that 

f b P.(x)P,(x)p(x) dx - if i * j 

Ja 

Problem 18. Show that -yj +1 < 7* for the fundamental interval ( 1, 1). 
Pl+i(x)p(x) dx g xP;(a!)p(x) dx < Pl(x) P (x) dx 



6.10. Complete and Closed Sets of Orthogonal Polynomials. A set of 

orthogonal polynomials is said to be complete if for every continuous 
function f(x) we have 

lim I* R*(x)p(x) dx = (6.42) 

where R n (x) is defined by (6.36). From Sec. 6.8 we have 

n 

[ b Rl(x) P (x) dx = ff 2 (x) P (x) dx - y c| 

Ja Ja Lj 

so that (6.42) is equivalent to the statement 

00 

f b f(x) P (x) dx = y d (6.43) 

Ja Lj 

We shall prove in Sec. 6.11 that the orthogonal polynomials generated 
above are complete for any finite range (a, b). 



250 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

DEFINITION 6.1. A set of orthogonal polynomials is said to be closed if 
the vanishing of all the Fourier coefficients of a continuous function f(x) 
implies f(x) = 0. In other words, if f(x) is continuous and if 

I b f(x)p n (x)p(x) dx = n - 0, 1, 2, . . . (6.44) 

Jo. 

then f(x) s 0. 

Problem 19. Show that a complete set of orthogonal polynomials is a closed set. 

/b 
x n f(x)p(x) dx - forn 0, 1, 2, 
* 
. . , and conversely. 

6.11. Completeness of the Orthogonal Polynomials for a Finite Range. 

By a linear transformation the fundamental interval (a, 6) can be reduced 
to the interval l^zglifa and b are finite. The famous Weierstrass 
approximation theorem states that a continuous function on a closed 
interval can be approximated arbitrarily closely by a polynomial (see 
Courant-Hilbert, " Mathematische Physik," vol. 1). In our case we wish 
to approximate a continuous function f(x) on 1 g x g 1 to within 
VV^o, where * is an assigned positive number and MO is the first moment, 

/i = f P(X) dx. The Weierstrass approximation theorem states that 
there is a polynomial Q(x) such that 



!/(*) ~ Q(x)\ < -1 ^ x g 1 (6.45) 

We denote the degree of Q by m so that, from Sec. 6.9 for n ^ m, 



I 1 [f(x) - 
J-l 



dx ^ [/(*) - Q(x)]* P (x) dx 
-l 



and [f(x) - Sn(x)] 2 p(^) dx< P (x) dx - 



Expanding [/(x) s n (x)]*p(x) dx as in Sec. 6.8, we have 



Jb-O 



n 

But Y c| y c| < f or n sufficiently large so that 



t-o 



t-o 
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00 

Since / f*(x)p(x) dx and Y cl are fixed constants whose difference can 

J ~ l o 

be made as small as we please, we must have 

00 

f^f*(x)p(x)dx= 1 cl 

* = 

This completes the proof [see (6.43)]. As an immediate corollary it 
follows that for f(x) continuous the relations [ x n f(x)p(x) dx = 0, 
n = 0, 1, 2, . . . , imply f(x) = 0. 

6.12. The Local Character of Convergence of the Fourier Series off(x). 
We are now in a position to examine the Fourier remainder R n (x) [see 
(6.36)] for the range 1 g x ^ 1. Let us assume that the p n (x) are 
uniformly bounded on this range. This means that |p n (z)| < M < <x>, 
1 g x g 1, for all n, M a constant. Let 6 be any small positive num- 
ber. Then for - 1 g x* 1 



f /"so- 3 [xo+8 fl I 

\ Mn(, & + / *>(*, Odt+ / /info * 

U-1 7xo-5 yxo-h J 

(6.46) 



where 

XQ 

[see (6.37)]. For the intervals - 1 g g z - 5, x + 5 g ^g 1 the 



function o " is well behaved. Hence 

z 2 

lim r*~\ n (x,()dt = 

n > oo y ~ 1 

lim f l p n (x, t) dt = 

n _ y*o+ 

(see Prob. 15, Sec. 6.8). Hence 
lim # n (zo) 

n > 

I! 



We need not worry about lim y n +i/y n since < y n +i/Vn < 1 (see Prob. 

n * 

18, Sec. 6.9). Whether lim R n (x<>) is zero or not depends entirely on 

n-n 

the behavior of f(t) in the neighborhood of t = x . If, for example, 
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A = constant, for all t near XQ, then 

\R n (xo)\ $ 6M 2 A5 M o 

for n sufficiently large. Since 6 can be chosen arbitrarily small, it follows 
that lim R n (xo) = 0. 

n 

For a complete discussion of orthogonal polynomials the reader is 
urged to read G. Szego, "Orthogonal Polynomials," American Mathe- 
matical Society Colloquium Publications, vol. 23. 

6.13. Fourier Series of Trigonometric Functions. In the early part of 
the nineteenth century the French mathematician Fourier made a 
tremendous contribution to the field of mathematical physics. A study 
of heat motion (see Example 22, Chap. 5) led Fourier to the idea of 
expanding a real valued function f(x) in a trigonometric series. 

f(x) = ao + fli cos x + a 2 cos 2x + 
+ 61 sin x + bi sin 2x + - 

to oo 

= is-ao + X a n cos nx + Y 6 n sin nx (6.47) 

n-l n-l 

At the moment let us not concern ourselves with the validity of (6.47). 
We wish to determine a, n = 0, 1, 2, . . . , and 6 n , n = 1, 2, . . . , if 
(6.47) were valid. From trigonometry we have 

sin nx sin mx = [cos (n m)x cos (n + m)x] 
sin nx cos mx = ^[sin (n m)x + sin (n + m)x] 
cos nx cos mx = ^[cos (n m)x + cos (n + m)x] 
so that 

if n 7* m 



sin nx sin mx dx .- 

-* TT if n = m 



/: 
/; 



sin nx cos mx dx = 

r 

7 if n 5^ m 
cos nx cos mx ax = .- 

T ii n == m 



(6.48) 



provided m and n are integers. 

If we multiply (6.47) by cos mx and integrate term by term (we are not 
concerned at present with the validity of term-by-term integration), 
we obtain by use of (6.48) 



cos mxf(x) dx m = 0, 1, 2, ... 
Similarly (6.49) 

i r* 

b m = - / sin mxf(x) dx m = 1, 2, 3, . . . 
T y ~T 
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The a's and &'s of (6.49) are called the Fourier " coefficients" of f(x). 
These coefficients can exist [if the integrals of (6.49X.exist] regardless of the 
possibility of expanding f(x) in the series given by (6.47). We can 
replace the interval ( TT, v) by the interval (0, 2x) if we so desire. The 
results of (6.48) and (6.49) hold for the interval (0, 27r). 

Example 6.2. We shall see later that f(x) x t for v ^ x ^ ir, has a Fourier- 
series development. We calculate the Fourier coefficients given by (6.49). 

1 [ * , 1 (x sin rax * If*. , \ 

a m - - / x cos mx dx * - I I sm mx dx ) 

irJ-T ir \ m -* my-r / 

5 [os mir cos ( mir)] =0 m 5^ 

TT m 2 



a 



xdx - 



-;/.: 



x sin mx = cos 



2 

m 
2 
m 



for m even 
for m odd 



Hence 



f(x) - 2(sin x - | sin 2x + 



sin 



- sin 4x + ) 



(6.50) 



Since the right-hand side of (6.50) is periodic, its graph is given by Fig. 6.1. We note 




that /(ir/2) x/2 - 2(1 
from 



O 



FIG. 6.1 

| + J - y + ). This checks the result obtained 



provided the series can be integrated term by term. 
Example 6.3. The function f(x) given by 

f(x) - -TT x 
/Or) -a: ^ or ^ T 



will be seen to have a Fourier-series expansion. The Fourier coefficientl of f(x) i 
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fln - i I - cos nx dx + - f * x cos nx dx - -i- [(~l) n - 1] 
iry-ir T./0 7&V 

fen - /. a; sin nx dx 

a; d# = 



a = 



Thus 



cos 



4 Z/ L ^ w2 

00 00 

cos (2n l)x _ V 



(-!)" sin 



4 r ^ (2n - I) 2 

H = l 



For x the right-hand side of (6.51) becomes 



n-1 



ao 

V 1 ?T 2 

Is / 7?: TT " "S"? I n complex-variable theory it can be shown that 

/.^ (^7i I) 4 O 



n-l 



(n 



(6.51) 



00 

VI TT 2 

If we allow z to tend to zero, we obtain ) -^ r~rro ~5~> ^ ne desired result. The 

l^ (Zn -r I)* o 

71=0 

limit process can be justified. 

For x ?r and x * TT the right-hand side of (6.51) becomes 



4 T Z/ (2n - I) 2 4 ' T ^ (2w - I) 2 

n - 1 = 1 

7T , 27T 2 T 



Notice that^ ( r ) "^/W . ?L^Z . The right-hand side of (6.51) when evalu- 
Z JL 

ated at either end point, "# = T, yields the mean of f(x) at these end points. The 
graph of the right-hand side of (6.51) is given by Fig. 6.2. 




FIG. 6.2 



2r 3r a; 
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Example 6.4. Let }(z) be analytic for \z\ > 0. Then 



/(*) - a n z n 

(6.52) 



where C can be chosen as the circle \z\ ** 1. On C we have f = e% d ie** d<p, so 
that 

00 

/(*) - y a n z 



a " 27rjo J 



For 2 on the unit circle |z| = 1 we have z = c* and 



(6.53) 



If we define g'(^) =s f(e t9 ), (6.53) becomes 



(6.54) 
1 r 2* 
2ir Jo 

Example 6.5. We can employ the results of Sec. 5.7 to evaluate the Fourier coeffi- 
cients of f(x). A particular solution of 

y" 4- n*y - /() (6.55) 

is given by 

y(x) - f*f(t)g(x - di 
where 0(x) is a solution of y" -f n 2 y 0, 0(0) 0, 0'(0) = 1. Thus 

sin n(x - t) dt (6.56) 

is a solution of (6.55) with y(Q) - 0, y'(Q) * 0. Evaluating (6.56) at x * 2ir yields 



so that 6. - /(0 sin nt dt ~ - $|57) 

An analogue computer can be used to solve (6.55) subject to the initial 
y(0) - 0, y'(0) - 0. 
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A graphical solution can be obtained for y(x). The height of the ordinate &tx = 2ir 
yields y(2ir), which in turn yields b n of (6.57). Differentiating (6.56) and evaluating 
at x 2ir yields 

f(t) cos nt dt = ~ y'(2ir) (6.58) 

Problems 

Find the Fourier series for the following functions defined in the interval TT < 
x <ir: 

1. f( x ) = for -TT < x ^ 0, f(x) - 1 for < x < ir 

2. f(x) = |s for --n- < x < ir 

3. f(x) = e* for -TT < z < ir 

4. /(x) = cos 2 x for TT < x < ir 

5. /(:r) cos a for |a:| ^ a 5^ 0, /(x) = cos x otherwise 

6. Let f(x) be an even function, that is, f(x) = f(x). Show that 

b n ~ - I f(x) sm nx dx = 
ir J TT 

7. Let/Or) be an odd function, that is, f(x) = f(x). Show that 

1 /"* 
a n = - / /(x) cos nx dx = 

7T J -TT 

8. Show that any function can be written as the sum of an odd function and an even 
function. 

9. Find the Fourier series of /(x) = |sin x\, IT ^ x ^ TT. 

6.14. Convergence of the Trigonometric Fourier Series. We wish now 
to investigate the convergence to f(x) of the Fourier series 



l 

cos ?ii d< n = 0, 1, 2, ... 



i^o + y a n cos nx + V b n sin nx (6.59) 

1 f w 

a n = - / 

* J -ir 

1 /"* - 
b n = _ / /() sin 

7T J -TT 

Some preliminary discussions are necessary. First let us consider 

lim r F(t) sinktdt (6.60) 

~ 



Intuitively we feel that for very large k the function F(x) sin kx will be 
positive about as often as it is negative (with the same absolute value) 
since sin kx will oscillate very rapidly for large fc. Since the integral 

/F(f) sin kt dt represents an area, we shall not be surprised if 
~* ,* 

lim f* F(t) sin ktdt = (6.61) 

& ^ """" 
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Certainly much will depend on the nature of F(x). It is easy to see that 
(6.61) holds true if F(x) has a continuous first derivative for w ^ x ^ TT. 
Integration by parts yields 

f* EI/ \ 7 j F(x) cos kx * . I f* . . , 
/ F(x) sin kx dx = - ^~ - + y / F'(z) cos fcx dx 

J -IT # T /C y _- 

From the fact that jF'(z) is bounded on the closed interval TT^X^TT 
it follows immediately that (6.61) holds true. 

Actually we can weaken or make less stringent the conditions on F(x) 
in order that (6.61) hold true. Nothing need be said about F'(x). Let 
F(x) and [F(x)] 2 be integrable on the interval TT ^ x ^ TT. We now 
define s n (x) by 



fc cos fcx + 6 A sin fco: (6.62) 

A-l 

1 /"* ' 
a k = - I F(t) cos kt dt 

* J -* 
1 /"* 

b k = - F(t) sin fc< d 
^" J ~T 

s n (x) is called the nth partial Fourier sum of F(x). Note that s n (x) is a 
finite trigonometric series. Moreover s n (x) is continuous. From 



(F(x) - (*)] = [*(*)]* - 2F(x) Sn (x) + 
we have 



s n )P<fc = /* [F)] J ctt - 2 

y-ir 

One can easily show (see Sec. 6.8) that 



n 

r F(o.(o ^ = ^ + , y (oj 

; " 



+ 



+ T -(aj + 



From the fact that f " [F(t) - s n (t)] 2 dt ^ we deduce that 

J ~ 7T 
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The left-hand side is monotonic increasing with n and is bounded by 

1 /"* 
the constant- / [F(i)]*dt. Hence 

*" J -r 



This is Bessel's inequality [see (6.41)]. Since the series of (6.64) con- 
verges, of necessity, 

lim a* = lim 6* = 

fc-oo fc-* 

lim bk = is precisely the statement (6.61). 
*-*<> 

A simple class of functions which are both integrable and integrable 
square is the set of functions which have a finite number of bounded dis- 
continuities in the sense that if a: = c is a point of discontinuity of such a 
function then lim /(a;) and lim /(a;) both exist but are not equal. We 

JB-+C X C 

X<C X>C 

write 

lim /(*) = /(c-0) 

x e 

lim /Or) =/(c + 0) 

X *C 

x>c 

A function which has a finite number of bounded discontinuities of the 
type described above is said to be sectionally, or piecewise, continuous. 
Now let/(z) be sectionally continuous for TT ^ x ^ ?r, and assume that 
f(x) has the period 2v, that is, f(x + 2?r) = f(x). We make use of the 
periodicity of f(x) in (6.67) below. It is not necessary, however, that 
/(IT) = /( TT), since the value of f(x) at one point does not affect the value 
of the integral [see (6.67)]. In most cases /(x) is defined only for the 
interval IT ^ x ^ IT. We*easily make f(x) periodic by defining f(x) 
at other values of x by the condition f(x + 2ir) = f(x). 

The nth Fourier partial sum of f(x) can be written as 



-^ / 



s n (x) = - f(t) dt + f(t) cos kt cos kx dt 



sin 



j /_'/(0 [j + ^ cos Jb - x)] < 
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It is important to note that s n (x) is a finite trigonometric series and that 
s n (x) is continuous. As long as the Fourier coefficients exist, it is always 
possible to construct s n (x). Thus s n (x) exists independent of the possible 
development of f(x) as a Fourier series. 

We desire to show that, under suitable restrictions concerning the first 
derivative of /(#), 

lim 8n(x) = i[/(s + 0) + f(x - 0)] (6.65) 

n 

From 

e wci-) - cos k(t _ x ) -j- i s i n fry _ x ) 

x) - cos m __ x ) _ i s i n ^(1 _ x ) 



the reader can show that 





if /(x + .) *LfrL+ 

IT J- r 2 sm (w/2) 



(6.67) 

v ' 

The last integral results from the fact that the integrand has period 2*-. 
From (6.66) the reader can deduce that 



: du 

v j _ T z sm (u/z) 

1 / T Gin (in JL 

so that / (x) = 



2 
and .(x) - /(*) = _ [/(* + ) - /(x)] 



1 /"* /(x + w) - /(x) M / 

- / - ! - - - -^ 7T~- - 7/n\ sm ( n 

v J - r u v 



7T~- - 7/n\ 

2 sin (w/2) 
Now consider x fixed, and define 



~ M 2 sin (w/2) 

will be sectionally continuous if /'(x) exists, for in this case 
lim F(u) = /'(x) 



Hence lim f * F(u) sin (n + ?)u du = (see Prob. 2 of this section). 
-* /- 
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We have shown that, if f(x) is sectionally continuous for TT g x g ?r and 
if /(x) has a period 2w, then, at any point x such that /'(x) exists, the 
Fourier series of }(x) given by (6.59) converges to/(x). 
It may be that/'(x) does not exist at the point x but that 

lim /(* + u) - f( X + 0) = f(f + Q) 

u-0 u 

; i l/(.+u)-/(x-o) = 

u-,0 W 

W<0 

do exist separately. We call f'(x + 0) and f'(x 0) the right- and left- 
hand derivatives of /(x), respectively. Now let the reader show that 



i 

T Jo 



f(x + u)~ f(x + 0) u , 

o ! 7 7o\ Sln ( n i T/ w 

o w 2 sin (u/2) 

/( + )-/(-0) u s . n(n 



- - - - - 7T-' T~AV; 

TT j-^ w 2 sm (w/2) 

It follows that if f'(x + 0) and/'(z - 0) exist then 

lim 8n (x) = Hf(x + 0) + f(x - 0)] (6.68) 

n 

We formulate the above results in the following statement: 
THEOREM 6.1. Let f(x) be sectionally continuous for TT g x g ?r, 
and let /(x) be periodic with period 2w. At any point x such that /(x) 
has both a right- and left-hand derivative the Fourier series of /(x) will 
converge to the mean value of /(x) at x defined by |[/(x + 0) + /(x 0)]. 
It should be emphasized that a sufficient condition for /(x) to be 
written as a Fourier series has been given. There are no known necessary 
and sufficient conditions for /(x) to be developed in terms of a Fourier 
series. Study of the Lebesgue integral yields greater insight into the 
development of Fourier series. References to this line of study are given 
at the end of this chapter. 

. Problems 
1. From the identity 

2 sin ^ cos ku sin (k -f ?)u sin (k i)w 
z 

deduce that 

n 

V , 1 , sin (n -f i)u 

/ cos ku - - - + n . . * 
Lj 2 2 sin (w/2) 

fc-i 

Hint: Let k 1, 2, . . . , n, and add. 
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2. Under the restriction that F(x) and [F(x)]* be integrable we have shown that 

lim / F(t) sin nt - 
n-*oo /- 

lim I* F(0 cosn* - 
n _>,c ./-* 

For the same F(x) why is it true that lim / F(t) cos (t/2) sin nl d* 0? Show that 

n _>oo y-T 

lim f* F(0 sin (n + ?)t dt - 0. 

n _eo ./ - 



3. Let f(x) have a continuous derivative for v ^ x ir, so that |/'(x)| < Af for 
IT ^ x ^ w. Show that the Fourier coefficients of /(x) satisfy 

|o*| < ?M * - 1, 2, 3, . . . 

What further restriction can be imposed upon f(x) so that |6*| < 2M/kt 

4. Let/(x) and [/(x)] 2 be integrable for the range TT ^ x ^ T. Consider the finite 
trigonometric series 

n n 

S n (x) = -|ao -f ^ dk cos fcx + 2^ 6* sin kx 

k=l k=l 

Show that J I [/(x) S n (x)] 2 rfx is a minimum for 
j -if 

<*fc = ~ / ^ /(^) cos fcx rfa: A: = 0, 1, 2, ... 

7T J -7T 

b* - - /"* /() sin kx dx k = 1, 2, 3, . . . 

TT y -TT 

6. Let/(x) * sin (1/x), x ^ 0, /(O) = 1. Is/(x) sectionally continuous? 

6. If /'(x) is sectionally continuous for TT ^ x ^ IT, show that /(x) is continuous 
for TT < x < TT. 

7. If /(x) is a continuous function such that /(x + T/3) = /(x), show that its 
Fourier series has the form 

00 00 

-jr + y a n cos 6nx -f y b n sin 6nx 

nl n=l 

8. Let (a n , b n ) be the Fourier coefficients of /(x), ja n , 0n! the Fourier coefficients of 
41 0(x). Find the Fourier coefficients of 



MX) - /(^ -t)g(t)dt 

9. Let/(x) be continuous on the closed interval a ^ x ^ 6. Subdivide this interval 
into a = x , x\, Xa, . . . , x n = 6. Now write 

n 

// (x) sin ax dx / / /(x) sin ax dx 
i ^W Jxi-i 

i-i 
n n 

" / / ** [/(*) "" /fo-0] sin ax dx + Y / *' /(x t .i) sin ax dx 

Lt Jxi-i L~t Jxt-i 



t-1 t-1 
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From the fact that f(x) is uniformly continuous deduce that 

/6 
f(x) sin ax dx ~ 
.. . ~ l 

Extend this result for f(x) sectionally continuous on a x 6. 

6.16. Differentiation and Integration of Trigonometric Fourier Series. 

We have seen that under suitable restrictions the trigonometric Fourier 
series of /(x) will converge to /(#). Now we are concerned with term-by- 
term differentiation of a Fourier series. Let us return to the Taylor- 

00 

series expansion of f(z). If > a n z n converges to/(z) for ^ \z\ < r. we 

LJ 
n-0 

00 

know that V a n nz n ~ l will converge to/' (z) for ^ \z\ < r. In the case 

n = l 

of Fourier series, however, a simple example illustrates that the new 
Fourier series obtained by term-by-term differentiation may not converge 
to/'(x) even though f'(x) exists. The Fourier series of /(x) = x, IT < 
x ^ ?r, is given by 

2(sin x ^ sin 2x + i sin 3x -j sin 4x + ) 
Term-by-term differentiation yields 

2(cos x cos 2x + cos 3x cos 4x + ) 

This series cannot converge since cos nx does not tend to zero as n 
becomes infinite. 

Certainly term-by-term differentiation of a Fourier series leads to a 
new Fourier series. If this new series converges to/'(x), we shall wish to 
make certain that /'(a;) has a convergent Fourier series. From Sec. 6.14 
we have seen that a sufficient condition for this is that/'(x) be sectionally 
continuous and that /'(x) have periodicity 2ir. It is not necessary that 
/'( ?r) =/'(TT). Let the reader show that, if f(x) is sectionally con- 
tinuous, then f(x) is continuous. Furthermore if /'(x) has periodicity 
2?r, the reader can show that/( TT) = /(TT) is sufficient to guarantee that 
/(x) has periodicity 2?r. We state and prove now the following theorem : 

THEOREM 6.2. Let f(x) be continuous in the interval TT g x ^ TT, 
/(ir) = /(TT), and let/'(x) be sectionally continuous with periodicity 2ir. 
At each point x for which f"(x) exists the Fourier series of /(x) can be 
differentiated term by term, and the resulting Fourier series will converge 
tof(x). 

The proof is as follows: From Sec. 6.14, f'(x) can be written 

00 00 

/'(x) = aj + Y a' n cos nx + Y b' n sin nx 

nl n-l 
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1 f' 

with a' n = - I /'(x) cos nxdx n = 0, 1,2,.. 

fl" J IT 
1 /"* 

&n = - / /'(*) s in nx dx n = 1, 2, 3, . . . 

IT./-* 

Integration by parts yields 

a' n = - /(x) cos nx H / /(x) sin nx dx 

~ | * J -v 



7T 

f 'IT 

sin nx rfx = nb n 



= - f f(x) sir 

T J -7T 



since /( TT) = /(?r) by assumption. b n is the nth Fourier sine coefficient 
of/(x). Similarly 

b' n = -na n 

oo oo 

so that /'(x) = 2, ~~ na n sin nx + Y nfr n cos nx (6.69) 

n-l nl 

The Fourier series of (6.69), however, is the Fourier series obtained by 
term-by-term differentiation of 



00 00 

/(x) = -^a + X fln cos nx "^ L, 



nx 



This proves the theorem stated above. 

We turn now to the question of term-by-term integration of a Fourier 
series. The reader knows from real-variable theory that integration 
tends to smooth out discontinuities, whereas differentiation tends to 
introduce discontinuities. As an example, consider the function /(x) 
defined as follows : 

/(x) =0 for-oo<z^0 
/(x) = x for ^ x g 1 
/(x) =2 for 1 < x < oo 

f(x) does not exist at x = and at x = 1. Moreover /(x) is discon- 
tinuous at x = 1. However, the Riemann integral of /(x) defined by 



- is continuous for oo < x < . At any point x y 1 [x = 1 is the only 
discontinuity of /(x)] we have F'(x) = /(x). Thus F'(Q) exists, whereas 
/'(O) does not exist. Let the reader show that F'(l) does not exist. 
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We find that very little is required to integrate a Fourier series term by 
term. We state and prove the following theorem: 

THEOREM 6.3. Let /(x) be sectionally continuous for ?r g x g TT. 



If s-a + Y a n cos nx + 2& n sin nx is the Fourier series of /(x), then 



n-l n-l 



oo oo 

/ /(x) dx = ~ (x + TT) + y sin nx y (cos nx cos /ITT) 

nl nl 

(6.70) 

Equation (6.70) holds whether the Fourier series corresponding to /(x) 
converges or not. Equation (6.70) is not a Fourier series unless a = 0. 
The proof is as follows: Since /(x) is sectionally continuous and hence 
Riemann-integrable, we note that F(x) defined by 

F(x) = /Cc) dx - |aox (6.71) 



is continuous. Moreover F'(x) = /(x) ^ao except at points of dis- 
continuity of /(x). If x = c is a point of discontinuity of /(x), the reader 
can easily verify that F'(c - 0) and F'(c + 0) exist. From (6.71) it 
follows that F(TT) = F(w) = ^aw. F(x) can be made periodic by 
defining F(x) for |x| > TT by the equation F(x + 2w) = F(x). From 
Sec. 6.14, F(x) has a Fourier-series development 



0) + F(x - 0)] - ^A Q + A n cos nx + # sin nx 



or F(x) - $A +, A n cos nx + J5 n sin nx (6.72) 



since F(x + 0) = F(x 0) from the continuity of F(x). Integration by 
parts yields 



"(x) sin nx 



A n = - I F(x) cos nx dx 



sm nx 



1 f* 

- / F- 

mr J _ T 

1 /"* 

/ [j"(^) ~ ^ ao ] s i n TXJJ dx 

WK J v 

, _1 [' f(x) 3 mnxdx= -i 
nwj.. r n 
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Similarly B n = (l/n)a w . We now write 



00 00 

(%) = i^o / - 6 cos nx + / - a n si 
L~i n L-4 n 



sn nx 



00 

From F(ir) = ^a G ir we deduce that AQ = a 7r + / - 6 n cos TITT. Using 

^V ^ 

n = l 

these results along with (6.71) yields (6.70). Q.E.D. 

Problems 

1. Differentiate and integrate the Fourier series for those functions of Probs. 1, 2, 
3, 4, 5, 9, Sec. 6.13, which admit these processes 

2. Sum the series 

1 -I +1_I+ . +_T 1 11_ + 

33 -T 5 3 73 1- t- (2/t + j), f ' ' ' 

3. It can be shown that if f(x) is continuous for TT ^ x ^ TT, f(ir) = /( TT), and if 
/' (x) is sectionally continuous for TT ^ x ^ TT, then the Fourier series of f(x) con- 
verges uniformly and absolutely for TT ^ x ^ TT. Multiply 

oo oo 

fW i a o + / an cos rue -f- y 6 n sin nx 

n = 1 n = 1 

by /(^)> integrate term by term (this is permissible since the new series converges 
uniformly), and show that 

dx = 



This identity is due to Parseval. 

6.16. The Fourier Integral. If f(x) is defined for x on the range 
oo < x < oo and if f(x) is not periodic, it is obvious that we cannot 
represent f(x) by a trigonometric Fourier series since such series, of 
necessity, must be periodic. We shall show, however, that under certain 
conditions it will be possible to obtain a Fourier integral representation 
of/(x). 

THEOREM 6.4. Let f(x) be sectionally continuous in every finite 
interval (a, fr), and let/(#) be absolutely integrable for the infinite range 



oo < x < oo, that is, / |/0*OI dx = A < oo. At every point x, 

J ~oo 

oo < x < oo , such that f(x) has a right- and left-hand derivative, f(x) 
is represented by its Fourier integral as follows: 



\\f(x + 0) + f(x - 0)] = - da f(t) cos a(t - x) dt (6.73) 



266 
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We proceed to the proof in the following manner: The result of Prob. 9, 
Sec. 6.14, as well as the methods of that section, enables us to state that 



(x - ())] = lim - I 

a-* oo 7T J a 



dt (6.74) 



i[/(x + 0) + /( 

at any point x for which f(x) has a right- and left-hand derivative, 
a < x < b. Now we investigate the existence of 



/_/<'> ^T^hr 

For a < x we observe that 

sin a(t 



(6.75) 



dt < 



\f(t)\dt 



The convergence of / |/(OI dt shows that for any e/2 > an A exists 

7oo 

such that f or a ^ A 



f(t) sin a(t - x) 

, t x 



dl 



<5 



independent of a. Similarly one shows that for any e/2 > a number B 
exists such that f or b ^ B 



f f(i) sin a( 

A rr 



independent of a. Thus the right-hand side of (6.74j can be made to 
differ from 



lim - 

a > oo 7T 



dt 



by any arbitrary e > provided a and b are chosen sufficiently large. 
This implies immediately that 

*[/(* + 0) + f(x - 0)] = Urn - /"" f(t) sm <*(* ~ *) dt (6J6) 

ax 7T J oo I X 

Now / cos p(t x) dp = ^ 9 so that (6.76) may be written 

Jo t - x 

1 /"" f a 

i[/( + 0) + f(x - 0)] = lim - / f(t) dt / cos /*(* - x) dp (6.77) 

-> TT y _, yo 

Before we pass to the limit as a ? we wonder if we can interchange 
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the order of the double integration. Let us first consider 

f(t) cos n(t x) dt 

x a parameter and ^ M ^ a, a fixed. We show that G Y (M) is continuous. 
For fj. = MO we have 



(t - x) - cos M o(^ - r)] * 

/y 
/(J)lcos M( x) cos MO( x)] dt 

+ I f(t)[co8 n(t x) cos no(t a;)] dt 

ip 

|/00 1 dt, and the last integral is less 

than 2 j \f(t)\ dt. Since we are given that I |/(/)| dt exists, a T can 

be found such that these integrals can each be made less than ^e for any 
> 0. T is obtained after e is chosen. From 

cos n(t x) cos /IQ( x) = (fj, /xo) sin ji(t x) 
p. between /i and M, we note that the middle integral is less than 



Since f \f(t)\ dt = A is a finite number, we can find a d > so that the 

middle integral is also less than e/3 for \p po\ < 6 < e/3A. Hence for 
any c > there exists a 5 > such that 

|G(M) -GG*o)| < e 

for \fjL /x | < 5. This is the definition of continuity, so that G(fj) is 
continuous at /x = JJLQ and, in fact, for all p. Hence 

H(a) = I dp I f(t) cos p(t x) dt 



dH /** 
exists, and -7 = / f(t) cos n(t x) dt. By considering 

ttOf J 00 

/(<) cos /.( - x) dn 
let the reader show that -r- = -3 Since X(0) = H(0), of necessity 



COS M ~ 
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Allowing a to tend to infinity enables us to rewrite (6.77) in the form given 
by (6.73). 

If f(x) is continuous at x, (6.73) may be written 

. f( x ) = - / da I f(t) cos a(t - x) dt (6.78) 

* JO J- 

From 

cos a(t - x) = Ke^'-o + e-*<*-'>) 

one readily obtains 

}(x) = ^ ^ da j^ /(Oe*<-'> dt 

= _L / e da f(t)e-*< dt (6.79) 

ZTT j _ oo 7 - 

Equation (6.79) can be written in the symmetric form 

1 



* = "75= 

v 

g(a} = 



We say that g(a) is the Fourier transform of the real function f(x), and 
f(x) is called the inverse Fourier transform of g(a). 

A more rigorous treatment of the Fourier integral requires the use of 
the Lebesgue integral. 

Example 6.6. We consider the function f(x) defined as follows: f(x) = for x < 

and/(x) = e~ 0x for x ^ 0, ft > 0. Fourier's integral certainly applies to this function, 

/ 
|/(x)| dx exists. From (6.79) 

g(a) = -4= I -"-* dt = -4= TTT^- 



- i r 

7T JO 



ft cos ax -f Q sin ax 
' cos ax + sin ax . 



From the definition of /(x) one has for x > 

I aX 4- a sin ax 



o 1 / * 

^ ~ L 

TT ;o 



1 [ COS ax a sin ax , 

_ . _ a 
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JL -Bx _ f cos ax 

20 "Jo p* -f-* 2 ( 



,1 j. v a ( cos aa; j 

sothat s^-yD FT** 1 , 681) 

-asinax, (6 ' 81) 



T /" " 

2 e ~** = Jo 

Let the reader obtain (6.81) directly by contour integration. 

Example 6.7. Suppose /(x) is an even function. Equation (6.78) can be written 

o /* oo r oo 

/(x) - I da I f(t) COS COS aX eft 

O /* oo / oo 

= - / cos ax dx \ f(t) cos at dt (6.82) 

TT ;o Jo 

If we wish to solve the integral equation 

e~ = f* cos a^/(0 rf/ 

for /(x), we apply (6.82) and obtain 

2 /" 

/(x) = / e~ a cos x rfj 

7T JO 



TT! + X 2 

Notice that /(x) is an even function and that this function has the necessary properties 
for the application of the Fourier integral formula. 

Example 6.8. Let p = -^ and assume Z(p) such that Z(p)e lfjlt = Z(iu>)e^ t . A 

d 2 
simple example is Z(p) = aop n -f a\p n ~ l ++ a n -\p + a n , where p 2 = -,--> etc. 

We consider the equation 

Z(p)0 (t) = 0(0 (6.83) 

We assume that the input O t (0 is given, and we desire to find the output (0 such 
that (6.83) will hold. Let tf(w) and H (w) be the Fourier transforms of t (0 and 
(0, respectively. Then 



i = , 

V 27T / - 

! r (6.84) 

ff () = ~7^: / 6-*'e (0 d 

V 2r J - 



VJ 
i r 

and Q,(Z) /= / e li "H t (u) du 

eo(0 = ^/-. e " //oMd ' 

Substituting into (6.83) yields 

Z(p) )_ elt H *^ dw * ]_ tto ^.() d" 
Assuming that ^(p) may be placed inside the integral yields 

Z(iu)e ttu> HoM du ** I e ttu H l M d<* 



270 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

Since the two integrals are equal for the full range of t, it appears logical that 



Hence 0.(l) - 



Z(p) is called the operational impedance, and its reciprocal, Y(p), is called the 
transfer function. We now have 



8 fl (0 -4^ f c"jSr,( 

V 2r J-" 



- 

V 2r 
Making use of (6.84) yields 



^ 1 Too Too Bi(T)F(?w)6UU , (t _ r)drrfco 

JTT y - * y - oc 
= f * e,(r)j/(/ -r)</r (6.85) 

y-oo 
where y(t - r) - - f F^)^'-^ dco 

ZTT y oo 

We have assumed that the order of integration could be interchanged. Equation 
(6.85) can be written 



9 (0 - / 9.0 ~ r)y(r) dr (6.86) 

y(r) is appropriately called the memory function because of the term t (< T). Let 
the reader compare (6.86) with the particular solution of a linear differential equation 
with constant coefficients given by (5.71). 

No attempt at rigor has been made in obtaining the results of this example. 

Problems 

1. Suppose that f(x) of (6.74) is an even function. Show that 

?[f(x -f 0) -f f(x - 0)] - r I da f f(t) cos at cos ax dt x 
If /(.r) is odd, show that 

?lf(x -f 0) -f f(x - 0)] - ? f " rfa f" jf(0 sin a* sin ax dt z 

2. Find the Fourier transform g(a) of the function /(x) defined as follows :/(x) 
for x < 0, /(x) = I/a for ^ x ^ a, /(x) for x > a. Find lim g(a). 

3. Find the Fourier transform for /(x) denned as follows: /(x) =* for x < 0, 
/(x) - <r* for x ^ 0. Show that 

for x < 
* cos ax -f- sin ax , -w . _ 

ve~ x for x > 
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4. Consider g(a) - / e~ tat f(t) dt such that lim f(x) 0. Show that 

J- ar- 



g(a) = la 

6.17. Nonlinear Differential Equations. We discuss the nonlinear 
differential equation 

^) = M l (6-87) 

which can be written as the system 

dx 

= y 

(6.88) 



di =y 



di/ 

-j. = -x - rf(x, y) 

The solution of (6.87) f or M = is x = A cos (t + B), 
y = ^= -Asin(t + B) 

The method of Kryloff-Bogoliuboff is to vary A and J5 in the hope that an 
approximate solution to (6.88) may be found. This is essentially the 
variation-of-parameter method discussed in Chap. 5. 
We write 

x = r cos 8 

y = r sin B 

and obtain 

dx dr . . -dB 

-7- = -j- cos r sm 6 -rr 
dt dt dt 

dy dr . . . dB 

_ = __ sm< ,_ rcos 0_ 

Equations (6.88) can now be written 

-77 cos B r sin -77 = r sin B 
dt at 

-r- sin B r cos -77- = r cos M/(^ cos 0, r sin 0) 

(Zt dv 

dr 
so that -77 = n sin 0/(r cos 0, r sin 0) 

<W (6.89) 

-r j- = -r - n cos 0/(r cos 0, -r sin 0) ^ : 1 

For the range of values (r, 0) such that 

fj,f(r cos 0, r sin 0) <3C 1 
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one has to a first approximation 



(6 - M) 



Integration yields 6 = do + t. We replace of (6.89) by + t so that 
~ = M sm (0o + t)f(r cos (0 + t), -r sin (0 + 0) (6.91) 

Since -^- <C 1, the value of r will not change appreciably over the 
en 

interval (t, t + 27r). Integrating (6.91) on this basis [r is assumed con- 
stant on the right-hand side of (6.91)] yields 



r(t+2ar)-r(t) =M t * && (0o+0/fr cos (0 +0,-rsin 
H I" sin ^ f(r cos ^, r sin ^) d^ 

= MF(r) (6.92) 

r 2v 
where F(r) = I sin \l/ f(r cos ^, r sin ^) <i^ (6.93) 

The periodicity of sin \fr f(r cos ^, r sin ^) introduced the new limits of 
integration. Thus 

r(t + 27r) - r(Q M 

- a? - "S^W 

^ - -- 



represents the average change of r(t) for t ranging 

UK 

from t to t + 27T. Since -rr <K 1, the average change of r(t) can be replaced 

dr 
by -^r itself, at least to a first approximation. This yields 

^ = ~ F(r) (6.94) 

The integration of (6.94) yields the first approximation for r(t). The 
same method applied to the second equation of (6.89) yields 

- = 1 -f - / cos \f/f(r cos \I/. r sin ^) d\I/ 

(It ZiTTT I o 



and 9 = + I 1 + ^;<?(r) 1 < (6.95) 

with G(r) = I *cos^/(r cos ^-, rsin^)d^ (6.96) 
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Before applying Fourier-series methods to obtain improved approxima- 
tions, we discuss an example. 

Example 6.9. Van der Pol's equation is 



r(0 - r==- (6.97) 



Here f(x, y) = (1 x z )y, and 

f(r cos t, r sin ^) = (1 r 2 cos 2 ^)r sin ^ 
so that F(r) = r P* sin * ^(1 - r 2 cos 2 tf) d^ nr (l - - 

Equation (6.94) becomes 



whose solution is 



At t = 0, r(0) r . For r 5^ we have 

lim r(0 =2 

Since r 2 = x 2 -f 2/ 2 > we note that to a first approximation the motion in the phase 
plane ( x, y 37 j resembles a spiral motion into the limit cycle circle x 2 -f y 2 * r 2 = 4. 
From (6.95) we obtain G(r) 0, so that 0o -h t, which yields nothing new. 

To obtain an improved approximation to (6.87) we proceed as follows: 
Let 



sn 



x (t) = r(t) cos ^(0 + fux(r) + M a n (r) cos n 



*and let us attempt to make (6.98) a solution of (6.87). r(t) and 0(i) will 
be taken as the first approximation given by the solution of (6.94) along 
with (6.95). We shall attempt to find (r), a n (r), n (r), n = 2, 3, 
4, .... Terms involving /* 2 will be ignored. From (6.98) we obtain 



dx dr 



d6 \^ . n dd 

r sm -TT M 7 ttn sm nO -j- 
dt L^ dt 

00 

Zjn 
n$ n cos n0 j- (6.99) 
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mi da da dr F(r) da . - , , , r s j 

The term M TT = M -r- -TT = M -7^ T" 1S * tne order of M and so is 
dt dr at 2ir dr 

neglected. The same reasoning applies to the terms involving 

da n __ da n dr _ ju 2 da n , . 
M dt ~~ "Wdt ~~ Zx dr nrj 
dp n __ dp n dr __ M 2 dp n < 



Differentiating again yields 

j = -2 ~ -.7 sin - r cos 
dt 2 dt dt 

n -2 



sin n^? (6.100) 



Terms involving > - have been omitted since 



Moreover = 1 + G(r) 

at irr 



d< * 27T 
when we neglect terms of the order of M 2 - Substituting x(t) of (6.98) and 

d 2 r 

~ of (6.100) into (6.87) yields 

00 

M(r) - -F(r) sin - -C?(r) cos - ju / (n 2 - l) n cos nd 

7T 7T ~~q 



n2 



l)/3 n sin M0 = nf(r cos 0, -r sin 0) (6.101) 



n-2 



In the term p,f(x, x) we have replaced x by r cos 0, x by r sin 0, again 
neglecting terms of the order of M 2 - 

Since the left-hand side of (6.101) is a Fourier series, we expand the 
right-hand side of (6.101) in a Fourier series, 

00 00 

/(r cos 0, -r sin 0) = a (r) + V a n (r) cos nB + V 6 n (r) sin n0 

n-l n-1 
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1 {* 

with Oo(r) = H~ / f( r cos 0> ~~~ r 8 ^ n 0) ^ 
^ Jo 

1 P* 

a n (r) = - / cos nOf(r cos 0, r sin 0) dd n = 1, 2, 3, ... 
TT Jo 

i p* 

&n(r) = - / sin ndf(r cos 0, r sin 6) dd 
* Jo 

Equating coefficients of cos nO and sin nO in (6.101) yields 

p* 

F(r) = / sin B f(r cos 6, -r sin 6) dd 
Jo 

P* 
G( r ) = / cos 6f(r cos 0, r sin 0) d0 

yo 

a (r) = - J- / */( r cos 0, -r sin 0) d0 (6.102) 

ZTT Jo 



a n (r) = r~2 - rr / cos 710 /(r cos 0, r sin 0) dB 

ir(n L) JQ 9 ~ . 

n - J, tJ, 4, . . . 



r 2ir 

0n(r) = -7-5 - TT / sin nOf(r cos 0, -r sin 0) 

TT(n* 1) JQ 



dB 



The values of F(r), G(r) given by (6.102) agree with their previous 
definitions. It can be shown that x(t) given by (6.98) satisfies (6.87) 
with accuracy of order /u 2 for ^ t < <*> . 

Example 6.10. In Example 6.9, f(x, x) - (x 2 - l)i, so that 

(r s \ . r 8 

r J sin & sin 36 

Applying (6.102) yields 

(r) - 
(r) - n - 2, 3, 4, . . 

*(r) - - g 

/3(r) =0 n - 2, 4, 5, . . . 

Thus the improved first approximation is 

x(t) - r(0 cos (tfo + - M g sin 3(d + 
with r(0 given by (6.97). 

Problems 

1. The differential equation of a simple pendulum is & -f- (g/l) sin = 0. Take 
sin a ^ tf s /6, and show that the period of oscillation depends on the square of the 
amplitude. 

2. Solve x + x + iub\\ 0, M 1, for the improved first approximation. 

3. Solve 4- H- ^(sign a;)a: = 0, A( 1, for the improved first approximation. 
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4. Solve x -h x -h M( + /3x 2 )x = 0, M 1, for the improved first approximation. 
Show that a limit cycle occurs and that the radius of this limit cycle is r = \/4a/30. 
6. Consider the differential equation 

'x -f x + M x 2 = M 1 (6.103) 

written as the system 

dx 
-dt =y 

(6.104) 



---, 

such that, at t = , x = x, i/ = i/o, z = ZD. Tin 1 solution of x 4- x = is 

x(t) = A -f B cos (/ -f C y ) 

y = = /f gin ( -f- 0), 2 = -77 = B cos (/ -f O Tins suggests the trans- 
formation 

x w r cos 
y = 7 sin 
z = / cos 6 

in an attempt to find an approximate solution of (6 104) Let the reader slum that to 
a first approximation 

e = t + 
(h 
dt = WT 



~~ " ~ Wltil ^ + ^o) 2 = - ~ ^ 



and iu(0 = o ~~ " r ~ Wltil ^ 
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CHAPTER 7 

THE STIELTJES INTEGRAL, LAPLACE TRANSFORM, 
AND CALCULUS OF VARIATIONS 



7.1. Functions of Bounded Variation. In an attempt to define arc 
length of a curve one is led to consider functions of bounded variation. 
Let us consider a simple curve, F, given parametrically by x = f(t), 
y <p(), a ^ I ^ /3. Two distinct values of i are assumed to yield two 
distinct points on F, so that as t varies from a to 0, the point P with 
coordinates x = f(t), y <p(i) moves continuously from one end of the 
curve to the other without retracing its motion. We now subdivide the 
interval (a, 0) into 

a = to < ti < t z < ' < tk < t k+ i < ' < Jn-i < in = 

For t t k we have the point P k with coordinates x k = f(tk), yk = <p(4), 
Pit on F, A* = 0, 1, 2, . . . , n. The straight-line distance from 7V_i to 
P k is given by 



The sum total of these straight-line arcs is 

- ?(fc-i)] 2 }* (7.1) 



Now if for all manner of subdivisions of a ^ t ^ ft a constant A exists 
such that 



71 

I 



*"' (7.2) 



then <S W < \/2 A. Since the set of numbers {S n } is bounded, of neces- 
sity, a least upper bound (supremum), L, will exist for this set. We 
define L as the length of arc of F. For a discussion of the supremum see 
Chap. 10. Conversely, one easily shows that if the {S n \ are bounded for 



278 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

all subdivisions then a constant A exists satisfying (7.2) for all sub- 
divisions. The inequality of (7.2) leads to the following definition: 

Let f(x) be defined on the bounded interval a ^ x ^ I). If a constant 
A exists such that for all possible finite subdivisions of (a, b) into 

XQ = a < Xi < x 2 < ' ' x k < < x n -i < x n = b 
we have 

n 

T /(.r*) -/(A ,)| < A (7.3) 

W 
fc-1 

we say that /(x) is of bounded variation on a ^ x ^ b. 

A bounded monotonic nondecreasing or nonincreasing function is 
always of bounded variation. If f(x) is a monotonic nondecreasing func- 
tion, then 



!/(**) - /(**-i)| = [/(**) - /(**_,)] = /(&) - /(a) 



Thus (7.3) is satisfied for A > f(b) /(a). Another example of a func- 
tion of bounded variation is the following: Assume f(jr) has a continuous 
derivative for a ^ x ^ />. Then 



-/to-i)| = |to ~ A-i)/'(fe)| < M(x k - .r, 
since |f(x)| < Jf for a x b. Thus 

n 

I/to) - /to_0| < M k - ^-i = M(b - a) 



which proves our statement. 

Continuity of f(x) is not sufficient to guarantee that /(a*) is of bounded 
variation. For example, consider f(x) = x sin (l/:r), x 5^ 0, /(O) = 0, 
^x g 2/7T. Since 

lim /(x) = lim x sin - g lim |x| = = /(O) 

x- >0 x-0 ^ a 

/(x) is continuous at x = 0. Moreover /(x) is easily seen to be continuous 

/ 2\ / 2 2 2\ 

for x 7^ 0. Let us subdivide ( 0. - 1 into ( 0, ^ -----, ~ . . - ). 

\ ' 7T/ \ (2n + l)r 2n?r' ' TT/ 

Then 
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00 

V i 

Since / ^ r-y diverges, no constant A exists satisfying (7.3) for all 

Lj 2fc ~T~ 1 
fc = 

modes of subdivision. 

A fundamental result concerning functions of bounded variation is the 
following theorem : 

A necessary and sufficient condition that /(x) be of bounded variation 
on a ^ x ^ b is that f(x) be written as the difference of two positive 
moiiotonic nondecr easing functions. That the condition is sufficient 
follows almost immediately from previous considerations concerning 
moiiotonic nondecreasing functions. Now let us assume that f(x) is of 
bounded variation on (a, 6). Let x be any number on the interval (a, 6). 
Let us subdivide (a, x) into 

a = Xo < Xi < X2 < ' ' ' < Xk-l < Xk < ' ' ' < X n = X 

Then 

n 

)l < A (7.4) 



Some of the terms of 8 n are such that/(x r ) ^ /(x r _i), whereas other terms 
are such that f(x 8 ) < /(x_i). We write 

K n = P n + N n 

where P n is the sum of terms of 8 n for which f(x r ) ^ /(x,_i) and A T M is the 
sum of terms of S n for which /(x fc ) < /(x g __i). One easily shows that 
p n - N n = f(x) - /(a) so that 

S n = /(x) - /(a) + 2N n 

Sn = -/(X) +/(a) +2P n U ' J 

Since S n < A for all methods of subdividing the interval (a, x), we know 
from real-variable theory (see Chap. 10) that a least upper bound exists 
for the set [S n \. We call this least upper bound, F(a, x), the total 
variation of f(x) on the interval (a, x). The suprema (least upper 
bounds) of {N n } and {P n } are written as N(a, x), P(a, x), respectively. 
From (7.5) we have 



V(a,x) =/(*) -/(a) 
7(a, x) = -/(x) + /(a) + 2P(a, x) ' 

so that /(x) = /(a) + P(a, x) - AT(a, x) (7.7) 

From the very definitions of P(a, x), JV(a, x) we note that they are 
monotonic nondecreasing functions of x [see (7.6)]. /(a) + P(a, x) is 
monotonic nondecreasing, which proves the theorem. 
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Problem 1. Let f(x) = sin x for ^ x ^ 27r. Write f(x) as the difference of 
two monotonic nondecreasmg functions of x for this interval. 

Problem 2. If f(x) is of bounded variation and continuous for a ^ x ^ b, show 
that P(a, x) and N(a, x) are continuous for a ^ x ^ b. 

7.2. The Stieltjes Integral. An important generalization of the Rie- 
mann integral is the Stieltjes integral. The Stieltjes integral is defined 
as follows: Let f(x) and g(x) be real-valued functions of the real variable 
x for a ^ x ^ b. Subdivide the interval (a, b) into 

a = #o < Xi < Xz < - - - < x k -i < x k < < n = b 

and let 5 be largest of the numbers x k x k -i, fc = 1, 2, . . . , n. Now 
form the sum 

n 

)] (7.8) 



where ^ is any number such that j>-i ^ {* ^ j^. If lim 8 n exists 

7? > OO 

independent of the choice of the & and the method of subdivision, pro- 
vided 5 > 0, we call this limit the Stieltjes integral of /(x) relative to g(x) 
on (a, b), written 



f b f(x)dg(x) (7.9) 

Ja 



In the special case g(x) = x, (7.9) reduces to the Riemann integral. 

If f(x) is bounded and if g(x) is a bounded monotonic nondecreasing 
function of x, then S n of (7.8) satisfies the following inequality, 



m[g(b) - g(a)] g m k [g(x k ) - 

- g(a)\ 



k=l 



where m k is the infemum (greatest lower bound) of f(x) for x k -\ ^ x g 
0:^-1, AT* is the supremum (least upper bound) of /(#) for x fc _i g x g x^, 
m and M are the infemum and supremum of f(x), respectively, for 

n 

a ^ x ^ b. The supremum of the sums ) nik[g(xk) g(x k -i)] can be 

*-' 

called the lower Darboux-Stieltjes integral, L, and the infemum of the 

n 

sums ^ M k \g(xk) g(xk-i)] can be called the upper Darboux-Stieltjes 
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integral, U. If these two integrals are equal, we say that the Riemann- 
Stieltjes integral exists and write 



= [/= f b f(x)dg(x) 

Ja 



This definition is easily seen to be equivalent to the one given above. 

If f(x) is continuous in (a, 6), it is a simple matter to prove that L ~ U. 
Now if g(x) is a function of bounded variation, it can be written as the 
difference of two monotonic nondecreasing functions. It follows imme- 
diately that (7.9) exists if f(x) is continuous and g(x) is of bounded 
variation. 

Example!. 1. Let f(x) be continuous for ^ x 1, and let g(x) = OforO ^ x < -5-, 
g(x) = Ifor-g- ^ x ^ 1. For any subdivision not containing x = g we have dg(x] = 0. 
The subdivision covering x = ^ yields dg(x) 1. Thus S n /(), where is any 

number near x = \. Since hm /() = /(g-), we have / f(x) dg(x] = /(--) 

7J K ./O 

Problem 3. Let/(z) = J + i, i/(x) = x 2 for ^ x ^ 1 Show that 



f(x) dg(jr) =1 
Problem 4. If f(x) and <7'(^) are continuous for a ^ x ^ h, show that 

f b f(x] dg(x] = f b f(x)g'(x)dx 

Ja Ja 

The last integral is a Ricmann integral 

Problem 6. If f(x) and g(x) have a common point of discontinuity, show that the 
Stieltjes integral off(x) relative to g(x) does not exist provided the range of integration 
covers the point of discontinuity. 

Problem 6. If h(x) is nondecreasing, f(x) and g(x) continuous with f(x) ^ g(x), 

show that f b f(x) dh(x) ^ [* g(x) dh(x). 

Ja Ja 

n 

Problem 7. Let S n = Y g(h)[f(x k ) - /(arfc-i)], Xk-i ^ & ^ x k . Show that 



i 

n-l 



with #o = a, x, t = 6. Assume g(x) of bounded variation, /(#) continuous, and 
continuous at x a and x = 6. Show that 



Problem 8. Let /(#) = g(x} H- *^(a;) be continuous for a ^ x ^ fc, and assume 
a(x) /3(x) -f ^T(^) to be of bounded variation. Show that 



da(x) P fir(a;) d|3(a:) - T h(x) dy(x) + i f* g(x) dy(x) + i ^ h(x) 

Ja Ja Ja J a 
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Problem 9. Consider the sequence of continuous functions, /(x), n 1, 2, . . . 

00 

Assume y fn(x) converges uniformly for a ^ x ^ 6, and let g(x) be of bounded 
variation for this interval. Show that 

00 00 

f b Y V f b 

Jo, lm*i 1*4 Ja 

/oo 
f(x] dg(x), arid under what conditions 
i 
would this integral exist? 

7.3. The Laplace Transform. Let g(t) be a complex function of the 
real variable t defined for t ^ 0. Let g(t) be of bounded variation on the 
finite interval ^ t ^ R, R arbitrary. The function e~ zt with z = x + iy 
is continuous for all t, so that the integral 



(7.10) 
exists for all complex z. It may be that, for a given value of z, 

lim f* e~ zt dg(t) (7.11) 

exists. If this is the case, we write 

J o 6" dg(t) = Jim f* <r dg(t) (7.12) 

Equation (7.12) is called an improper integral, arid the right-hand side of 
(7.12) is called the Cauchy value of the improper integral. 
Those values of z for which (7.12) exists define a function of 2, written 

(7.13) 

f(z) is called the Laplace-Stieltjes transform of g(t). 

We consider now the region of z for which /(z) of (7.13) exists. First 

we investigate three special cases. Let g(f) = I e u du, so that g(t) is 

monotonic increasing, and hence is of bounded variation for ^ t ^ 72, 
R arbitrary. Since dg(t) = e et dt, we have 

fR , , . N , fR , , . fR , 

/ e -(*+*v)t e *' dt -. / e e<-xt cos yt dt - i e e xt sin yt dt 
Jo Jo J Jo J 

Since e* xt increases beyond bound for any fixed x, we leave it to the 

F R 

reader to show that lim / e~ zt e et dt fails to exist for all z. As a second 
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example, let g(t) = I e~~ u du. Let the reader show that lim / er** dg(t) 
yo # < JQ 

exists for all z. Finally, let g(t) - t, so that 

f R C R - -*& 

lira / e~ zt dg(t) = lim / cr zt dt = lim 



z . z 

provided x = Rl z > 0. Hence I e~ z ' ctt = /(z) exists for Rl 2 > 0. 

f 
Let us consider now a general case. We assume that / e~~ Zi)i dg(i) 

exists, with z = XQ + IIJQ. Further, let us assume that a constant A 
exists such that 

e-" dg(() < A (7.14) 



tor u S 0. Define h(u) by the equation 

h(u) = | o " c-'- dg(t) (7.15) 

so that dh(u) = p- z " </ST(M) and ^(0 = e 1 * dh(t). Then 

e- dg(t) = | o B e-f " dA) (7.16) 

Integration by parts (see Prob. 7, Sec. 7.2) yields 

[ K c~ 2t dg(t) = e~ (z ~ z )R f* cr*<* dg(t) + (z - z ) f* A(0c- ( " o) dt 
If Rl z > 111 2 , then lira e- (M8) * f B e- 20< dg(t) = 0, since 

^-^oo 70 



lim 

R-* oo 



and 



f R f-' dg(t) < A for all R. Moreover I f h(t)<r ( *- zo)t 



dt 



r lAWIc-^-'^'rf^ < A/(x - x ), for x > x . Allowing /f to become 
infinite in (7. Hi) yields 



(2 - 20) 
for Rl z > Rl 20. Thus 



f(z) = r-" rfj/(/) (7.17) 

exists for Rl 2 > Rl 20. We have shown that, if (7.17) converges for 
2-0 = #o + iyo, then/(2) of (7.17) is well defined f or 2 = # + iz/ provided 
u 1 > XQ. There may be singularities of /(z) on the line js = XQ + iy, 
< y < oo , and in the half plane Rl 2 < x . 
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The ordinary Laplace transform of f(f) is defined as 

F(z) = L[/(0] = erf(t) dt (7.18) 



provided the improper integral exists. If the Laplace transform of f'(f) 
also exists, one can integrate by parts to obtain 



W(*)] = L ^"/'(O dt = e-'/(0 + z / e-/(0 d* 

JO 70 /y jgx 

jLF/*'^)] = zZJ/Yrtl /(O) v'* Xty / 

provided 

lim e-f(t) = (7.20) 

Equation (7.20) will certainly hold if \f(f)\ is bounded for t ^ and if 
Rl z > 0. Further application of (7.19) yields 

L[/"(0] = zL[f'(t)] ~ /'(O) 

= z*L(f(t)] - zf(0) -/'(O) (7.21) 

If F(2) is the Laplace transform of f(t), we say that /(/) is the inverse 
Laplace transform of F(z), written f(t) = L" 1 ^^)]. 

Example 7.2. We find the Laplace transform of sm at. We have 

F(z) = I e~ zt sin at dt 
Integration by parts yields 

/e~ zi sin at dt - ^.-~ (~~z sin at a cos at} 
) a 2 -f z 2 t=o 

If Rl 2 > 0, we have 

r oo a 

I e~ zt sin at dt ; , 

Jo a 2 H- 2 2 

Table 7.1 lists Laplace transforms for some simple cases of /(). 

Problem 11. Derive the result* of Table 7.1. 

Problem 12. Show that L[af(t) + bg(t)] = aL\f(t)] -f bL[g(l)]. 

Example 7.3. Let us consider the differential equation 



subject to the initial conditions y(0) = i/o, 2/'(0) = t/o- Assuming that the solution 
of (7.22) and its derivatives are such that their Laplace transforms exist, we can apply 
(7.19) and (7.21) and the result of Prob. 12 to obtain 

z*L[y(t)\ - zy* - y' + 3zL[y(t)] - 3y Q - 4L[y(t)} - 



,,, rr /A1 (z o Q 

so that L[y(t)] = . , . w - - - - - -- -h 



(z -f4)(z -1) 5 z + 4 5 2-1 

3/0- 2/o rr mi , 4 2/o + 2/o 

- L[<pi(t)] + 

5 5 
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From Table 7.1 we see that <f>i(t) e" 4 *, <f> z (t) e*, so that the suggested solution of 

(7.22) is 



y(Q 



5 



(7.23) 



One easily checks that y(t) of (7.23) is the required solution. It is to be noted that the 
Laplace-transform method for solving (7.22) introduces the initial conditions in a 
natural manner. 



TABLE 7.1 





/(/) 


F(z) - fa e-/(0 eft 


1 


1 


i. HI 2 > 




a<lt 


. PI ** pi .- 






z a 







a t Pi ? v. n 






z s + 02 ' Ki * > 


A 




z , "Ri - > n 






2 2 + 2' K1 2 > 




' 


a . T?l - >, l/l 






2 2 _ fl2 ' 1U Z > l a l 






2 . , PI - >s |/,| 






2 2 _ fl 2 f K1 Z > l a l 


7 


t . 


2 Pi >^ n 




2a 


- (z i + fl .)i' K1 z > 




1 


pi - v. n 






(2 2 + .)' R1 2 > 


9 


r d\ 


(V^ 2 -f i - *Y 






Vz* + 1 


10 


t n-l 

(n - lj 


4> Rl 2 > o 

2" 



Problem 13. Solve -j-| + 2^ = l~3a; by the Laplace-transform method, 

0, 7/'(0) 1. 
Problem 14. Let /(Q = for < 0. Show that 



I 



- *) dt 



/" " 6~^/( - 05) d< e- I * e~ rf 



provided the integrals exist. 
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Example 7.4. Let, us attempt to solve the wave equation ---\ = -^ ~ 

subject to the boundary conditions y(x, 0) = ~ - - for all x, and y(Q, t) = f(t) 

for t ^ 0. We assume y(, t) Inn y(x, t) ~ for t ^ 0. Physically, we have 

a?-* * 

an elastic string from x ~ to x = <, initially at rest. At the origin the string is 
constrained to move in such a manner that y(Q, t) = /(/), f(t) a given function of time. 
If we multiply the wave equation by e~ xt and integrate from t to t oo f we obtain 

~ 8 f* y(x, 0*-" dt = -* z*L\ii(x, t}} 
provided we assume T-^ / y(jc, t)er zt dt = / ' - --- e'^ dt. Thus Lf?/| satisfies 

" X J J Q f > X 

,L(y] = J /,(//] (7.24) 

A solution of (7.24) is 

* , t)c- dt - Xe ( / e ) + Be-<">* (7,25) 



f 



where ^4. and B can be functions of z. Since t/(ar, tends to zero as x becomes infinite 
we choose A - 0. At x * we need 



f 



e'" dt 
From Prob. 14 we have 

f " 2/(a;, Oe"" rt dt - c~ w / c /" " /(O*"* 

Equation (7.26) suggests that 2/(z, /) - /(/ - a;/c) for t ^ a?/c provided f(t - /c) - 
for t < x/c (see Prob. 14). It is a simple matter to show that/(* x/c) satisfies the 
wave equation and the boundary conditions. Of course one needs the fact that f(t) be 
twice differentiable. 
Example 7.5. We wish to find the function f(t) such that 



ftt)e-dt - (2 2 - 1)-! (7.27) 

00 

Let us assume that f(t) has the Taylor-series expansion. / a n t n . Without justifica- 

nt'o 

tion let us assume that term-by-term integration is permissible. Hence 



n0 

Now for \z\ > 1 we know that 

1 ^ 1 ^ V (2m)! 1 

x/inri " 2 Vi - (i/2) 2 * ^ 

Wl ^ U 
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Comparing the two Laurent series, we see that a n =* if n is odd and, if n 2m, 

- - jjjoam' 80 that 

oo 

= y ^-j~j QV ( ? - 28 ) 



We note that/(tO == f ~f~T (~\ = Jo(t), where J" (0 is the Bessel function of 

order zero of the first kind. One can start with/(2) of (7.28), justify the interchange 
of integration and summation, and show that (7.27) results. 

Problems 

1. Solve ^r* + y = sin x with j/(0) = 0, j/'(0) = 0. 

2. Solve ( -~ 2 - 4// = 3e zx with y(0) = 0, y f (0) = 1. 

3. Solve 2x ^ + ~ - 2z, 2/(z, 0) - 1, t/(0, - 1, for y(x, t). 

Am. y(x, t) 1 + for < t < x* y y(x, t) - 1 + x 2 for * > x 2 . 

4. By the inversion formula (see Sec. 7.4) solve for/(Z) if 



Ans. f(t) - J sin a* (see Table 7.1). 

5. From Prob. 4 show that 

r oo 2 2 a 2 

/ e~~ zi t cos a< CM = T~T~I ?T 9 

yo (2 2 + o 2 ) 2 

6. Find f(t) such that 

I* e-J(t)dA - . 2 Z 

7.4. The Inversion Theorem. Let 0(0 satisfy the requirements which 
enable one to write 

1 t * T 00 

0W = o- / dy / ^(0 cos y (^ "" dt (7.29) 

zrj - J-oo 

(see Sec. 6.16). We assume that I * f(w) dw converges absolutely, and 
choose ^(0 = e-**f(() for t ^ 0, flf(0) = for t < 0. Since 

= o~ / dv I g(t) sin v(w f) dt 
2arJ-* J- 

we have 

i / . iw r- 

-i r+-*]' 

Z*J- Jo 



O 
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Now assume 

" dt 



exists for Rl z x > 0. Then 

IV F(z)e* dz = ( x+iv e> dz f " e-'f(t) dt 

iy Jx~iy JO 



= e w 



(7.31) 



on letting z = x + iv, v a new variable of integration. On letting y 
become infinite and comparing (7.31) with (7.30) we see that 



/<) = 0= 



F(z)e wz dz 



(7.32) 



If F(z) is the Laplace transform of f(w), (7.32) enables one to find /(it?) 
in terms of F(z). This is the Laplace-transform inversion theorem. 
Equation (7.32) can often be evaluated by the calculus of residues. 

Example 7.6. Let F(z) = l/(z + 1) 
Then from (7.32) 

I- 1 e wt , 




with x > 0. 



We consider U) 



dz around 



the path given m Fig. 7.1. Let the reader 
show that, as the radius of the semicircle 
becomes infinite, the integrals of e wz /(z + 1) 
tend to zero along BC, CDE, EA. The resi- 
due of e m /(z + 1) at z = -1 is e~ w . Thus 

f(w) = 2~^ I2irt~"1 * *~~ 

7.5. The Calculus of Variations. 

The calculus of variations owes its 
beginning to a problem proposed by 
Johann Bernoulli near the completion 
of the seventeenth century. Suppose 
two points to be fixed in a vertical 
plane. What curve joining these two points will be such that a particle 
sliding (without friction) down this curve under the influence of gravity 
will go from the upper to the lower point in a minimum of time? This is 
the problem of the brachistochrone (shortest time). A problem of a 
similar nature is the following: What curve joining two fixed points is such 
that its rotation about a fixed line will yield a minimum surface of revolu- 



FIG. 7.1 
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tion ? This is the soap-film problem. A third problem asks the following 
question: What curve lying on a sphere and joining two fixed points of the 
sphere is such that the distance along the curve from one point to the 
other is a minimum? This is the problem of geodesies. 

Let us obtain a mathematical formulation of these problems. 

1. Brachistochrone. Let the particle P move along any curve given by 
y = y(x) (see Fig. 7.2). The speed of the particle is given by v z 2gy 

or -y = \/2g y*. Hence 



ds 



+ dy 1 



T~(2/T 2 



dx 



The total time of descent is given by 



7> = -!_ 



+ (y'Y' 



dx 




(7.33) 



FIG. 7.2 



To solve the brachistochrone problem, one must find the function y(x) 
which makes (7.33) a minimum. 

2. Minimum Surface of Revolution. If the curve y(x) lying above the 
x axis is rotated about the x axis, the surface of revolution generated in 
this manner is 



S = 27T y ds = 27r 



(y'Ydx 



(7.34) 



To solve the soap-film problem, one must find y(x) such that S of (7.34) 
is a minimum. 

3. Geodesies of a Sphere. In a Euclidean space we have 

ds* = dx 2 + dy 2 + dz z 
For a sphere x = r sin cos <p, z/ = r sin sin ^, 2 = r cos 0, so that 

ds 2 - r 2 (d# 2 + sin 2 d^ 2 ) = r 2 1 + sin 2 (^j dd 2 . 



If ^ <^?(0) is any curve on the sphere, the distance between two points 
of the sphere joined by ^ = <p(8) is given by 



L = 



dO 



(7.35) 
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To find the geodesies of a sphere, one attempts to find <p = <?(6) such that 
L of (7.35) is a minimum. 

Formulas (7.33) to (7.35) are special cases of a more general case. Let 
f(x, y, y') be a function of the three variables x, ?/, y 1 '. We wish to deter- 
mine y = y(x), and hence y' = y'(x), such that 



= f n f(x,y,y'. 
j 71 



) dx (7.36) 

will be an extremal (minimum or maximum) subject to the restriction that 
y(xi) = //,, yfa) - 2/2. For (7.33), (7.34), (7.35) we have 



f(x, !/, y') = 2wy[l + (2/') 2 P, /(*, 2/, 2/0 = r A/T+sin 2 z(2/') 2 dx with 
x = 6, y = <p, respectively. 

Let us see how a problem in the calculus of variations differs from an 
extremal problem of the ordinary calculus. In the latter case we are 
given the function y = y(x). For each real x there corresponds a unique 
real number y. Thus y = y(x) maps a set of real numbers into another 
set of real numbers. The relation y = I/a:, < x g 1, maps the 
interval < x ^ 1 into the interval y ^ 1. A simple problem in the 
ordinary calculus is to find a number x which yields the minimum or 
maximum value of y y(x). Now (7.36) may also be looked upon as a 
mapping. For any function y(x), (7.36) defines a real number /. Thus 
(7.36) is a mapping of a function space [the space of y(x)] into the real- 
number space. Our problem is to select a member of the function space 
which yields a minimum or maximum value for / of (7.36). To resolve 
this question, we reduce the problem to one of the ordinary calculus. 

7.6. The Euler-Lagrange Equation. Let us assume that there exists a 
function y(x) which makes I of (7.36) an extremal. We now consider the 
class of functions 

Y(x, X) = y(x) + \n(x) (7.37) 

where 77(2) is an arbitrary differentiate function such that r\(x\) = 0, 
r?(x 2 ) = and X is a real parameter. We have Y(x\, X) = y(xi) = T/I, 
F(# 2 , X) = y(x%) = 2/2. For X = we have Y(x, 0) = y(x). As we vary 
X and rj(x), we obtain a family of curves passing through the two 
given points PI(XI, y\}> Pi(x^ 2/2). Moreover X = yields the desired 
curve, y = y(x). The value of / for any member of (7.37) is 

/(X) = f **f(x, y + Xn, y 9 + XT/') dx (7.38) 

Jx\ 

For any fixed rj(x] we know that 7(X) is an extremal for X = 0. From the 



calculus, of necessity, 
we have 
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/)/ r)/ 

= 0. Assuming continuity of and . 
x~o dy dy' 



d\ 



^-r>dx 






51 fXt J / 3f 

_ / (| 
J XI dx \dy' 



(7.39) 

^ 



upon integration by parts. Since rj(xi) = 77(0:2) = 0, we have 



it is a simple matter to show that, if M(x) is continuous on Xi ^ x ^ x^ 
then the vanishing of / 2 M(x)ij(x) dx for art)itrary ??(x) implies M(x) = 0. 

yri 

We leave this as an exercise for the reader. Hence if y(x) is the required 
solution, of necessity, y(x) must satisfy the differential equation 



(7.41) 



_- 

da; \dy'J dy 



Equation (7.41) is the important Euler-Lagrange equation. It can be 
written as a second-order differential equation in the form 



a 2 / 



I i/ ./ *-* { j i ^ 
dy 'fy'dx dx~ 



_ =n 

y' dy 



(7.42) 



If we differentiate / y 1 - 1 -, with respect to x along the curve y(x) 
satisfying (7.41), we obtain 



dx dij a//' 

by makiiifi; use of (7.41). Thus 

d 



dy' 



dx\dy' ~ dx 



(7 ' 43) 



for an extremal. If/ is explicitly independent of y, one has a first integral 
of (7.41) given by -, = constant. If / is explicitly independent of x, 

j- = and (7.43) yields the first integral, 

/ - y' ~ = constant (7.44) 
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One may obtain the Euler-Lagrange equation from the following point 
of view: First let us consider the equation y(x) = x* 2x, so that 



y(x 



dx) = (x 



- 2(x + \ dx) 







Differentiating with respect to X yields = 3(x + X dx) 2 dx 2 dx, so 

c/A 

= (3x* 2) dx = dy, th. differential of y. Now 



dy 

we have seen that (7.36) can be looked upon as a mapping of a function 
space into a real-number space. We call this mapping a functional of y, 
written 7 = I[y], The first variation, or differential, of 7 may be 
defined as 



= lim 



OA 



(7.45) 



where 8y(x) is any variation in y(x). Applying (7.45) to (7.36) yields 

I[y + \8y]= \ f(x, y + \dy,y'+\ dy') dx 
' 



dl 



= /"' 

Jxi 

r 

7 8y' dx by parts yields 



(7.46) 



Integrating 



provided 8y(xi) = 8y(x 2 ) = 0^ Equation (7.41) holds, of necessity, if 
67 = for arbitrary dy. 

In the calculus it is necessary to examine the second derivative of y(x), 
and, at times, higher derivatives, in order to determine the type of 
extremal (maximum, minimum, point of inflection) encountered at the 



point x 



fij 
for which -^- = 0. Similarly, in the calculus of variations 



it is necessary to examine the second variation in 7 to determine what 
type of extremal is obtained from the solution of the Euler-Lagrange 
equation. Let the reader show that the second variation of 7 can be 
written as 



527 - 






5y ' 






dx (7 - 47) 
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Example 7.7. Equations (7.33) and (7.34) are special cases of 



/ = / g(y) vi + (y')*dx 

We have/ = g(y) [1 -f (y') 2 ]l, ~ = 0, so that a first integral is obtained from (7.44). 

ax 

Let the reader show that (7.44) yields 

f ^L==^. = constant = a 

From tan = - we have cos l/\/l -j- (?/) 2 so that 

</(*/) = a sec (7.48) 

Moreover gr'(?/) dy = a sec tan d0 from (7.48), and dx = cot dy so that 



yielding 



a sec dO 

dx = ---- 77 \~ 

g (y) 



-6 + (7.49) 

Jo g'(y) ^ ' 



Given 0(t/), one solves (7 48) for y as a function of 0. Then integration of (7.49) yields 
j as a function of 0. This parametric representation of x and y as functions of yields 
the required curve which extrernahzes /. 

In the brachistochrone problem g(y) = y~~%, so that ?/ = r cos 2 = (r/2)(l -f cos 20), 
c = I/a 2 , from (7.48). Thus /(*/) = -|// * = (-2r* cos 3 0)~ l , and (7.49) yields 

re re 

-- b - 2c \ cos 2 0d0 = b - c (1 -f cos 20) d0 
Jo Jo 

x(o) = 6 - C (20 + sin 20) (7.50) 

7/(0) = I (1 + cos 20) 

Equation (7.50) is the parametric equation of a cycloid 

Example 7.8. Vanable-end-point Problem. We are given the fixed curve T, 
y = (p(x), and the functional 



fx 2 = b 

1 = / F(x, y, y') dx 

Jxi 



\Ve wish to find the curve y y(x) joining the fixed point A(XI, y\) and B(b, <f>(b)), 
where B is a point on F such that 7 is an extremal. The coordinate x = b is unknown. 
If we consider the curve y(x) -f dy(x), we have 

fe y 4- *y, 2/ r H- fy') ^ 

The upper limit has changed since the end point of y(x) + 8y(x) is constrained to lie 
on y = ^>(#). Let the reader show that 



+ 8y] - /[y] - [F(x, y + by, y' + by') - F(x, y, y')} dx 
Jxi 



F(x, y -f- By, y' + y') cte (7.51) 
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From (7.51) it is logical to define 57 by 

SI = f b (~ dy + |^ By'} dx + F(x, y, y') dx (7.52) 



Now dy = at x = a. We must compute by at x = b. We have 
y(b + dx) -f dy(b + Sx) = <p(b + dx) 

y(b) = <p(b). Thus y(b -f SJT) - y(b) + dy(b + dx) = <p(b + te) - *>(&). Applying 
the law of the mean one easily shows that 

= [v'(b) - y'(b)} dx 

b 
Integrating the second part of the integral of (7.52) by parts yields 

-JL'K-(S)] *<+['+ 1 "-">]..." 

If 6/ =0 for arbitrary 8z/, of necessity 

\F + (^' - y') IJ1 =0 (7.53) 

L o^ Ja; = 6 

along with the Euler-Lagrange equation. 

Let us apply (7.53) to the problem of the minimum surface of revolution with <p(x) 

dF 
arbitrary. We have F(x, y, y') = y\l + fe') 2 ]^, ^7 = yy'[l -f fe') 2 ]^, so that (7.53) 

becomes 

!?/[! -f (y')*\* + yi/'[i + (v'W-W - y f )\*-i, = o 

or t/ V = 1 at x = 6. Hence, at their point of intersection, y(x) and <p(x) intersect 
at right angles. 

Problems 

1. Show that the solution of the soap-film problem is y = a cosh - 

2. Show that the geodesies on a sphere are arcs of great circles. 

3. Derive (7.47). 

4. Show that the Euler-Lagrange equation for the functional 



- -. 

dx* 7 / dx \dy'J dy 

5. Show that there are no extremals or stationary values for the functional 

l[y] = / T2 y dx 

Jxi 

6. Consider the functioniil 

dt 



/ f(xi, x%, . . . , x, xi t x 2 , . . . , x n , t) 
Jt\ 
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Show that the Euler-Lagrange equations are 



Explain why it is necessary that 

f(Xi, Xi, , Xn t XJl, XJ* 2 , , X.T rt , = A/(:ri, 0-2, . . , X M , J'i, J i, . . . , X n , t) 

7. Apply (7 54) of Proh 6 to 



8. In Chap. 3 we saw that Lagrange's equations of motion could be written as 



- 

dt \dqtj dqi 

/tz 
L dt is an extremal for Newtonian 

motion. For a conservative system, T -f V = constant = h Show that New toman 
motion for a conservative system is such that / 2 T dt is an extremal subject to the 
condition T -f V = constant 

9. Consider the functional I[z] = II F I x, y, z, J dy dx, the region of 

S 

integration, S, having the simple closed curve C as its boundary. Show that, foi 
f{z] to be an extremal, z = z(x, //) must satisfy 

dF _ d Z F _ J>*F_ 
dz dx Op dy dq 

dz dz r , 

with p = ~> q = in F 

r dx ^ dy 

7.7. The Problem of Constraints. In the ordinary calculus one solved 
problems of the following type: Given the function of two variables, 
z = f(Xj y), at what point P(x, y) is z an extremal subject to the con- 
straint <p(x, y) = constant? We Enow that if /(x, 2/JT.s continuous on 
the closed and bounded curve C, ^(x, y) = constant, there will exist points 
on C at which z takes on minimum and maximum values. One way to 
solve this problem is to solve for y from <p(x y y) = constant, obtaining 
y = \l/(x), and then to extremalize z = /(x, ^(x)), a one-dimensional prob- 

lem. Another method is due to Lagrange. For an extremal, -j- = 0, 

dx 

so that + -^~ -r- = 0- This same equation can be obtained as follows: 
dx dy dx 

Consider w = /(x, y) + X<p(x, z/), X a parameter. Compute and 
as if x and y were independent variables. Setting these two partial 
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derivatives equal to zero yields 



Eliminating X yields 



dx dx dx 

^ = ^- + X^ = 
dij dy dy 



| 



dx dy \ d<p/dy 
, y 

~- 



Along the curve <p(x, y) = constant, we have -^ + -JT ~T- = 0, so that 



~r--j 
dx dx d<p/dy 



and Laqrange's method of X multipliers yields 



TT- + - -y = 0, the required equation for an extremal. 
dx dy ax 

The simplest constraint problem in the calculus of variations is the 
following: We wish to extremalize the functional 

/[?/] = j b f(x,y,y')dx (7,55) 

Ja 

subject to the constraint 

/ <P( X , y, y'} dx = constant (7.56) 

Ja 

As in Sec 7.6 we let F(x, X) = y(x) + X^^x) + X 2 T7 2 (#), 

and we desire to extremalize 

/(Xi, X 2 ) = / f(x, y + XIT?! + X 2 ?72, y f + XirjJ + X^) dx 

Ja 

subject to the condition 
/(Xi, X 2 ) = / <P(XJ y + XIT?I + X 2 r?2, y' + X^i + X 2 77 2 ) dx = constant 

Ja. 

/(Xi, X 2 ) is to be an extremal at Xi = X 2 = 0. 
Let the reader show that the Euler-Lagrange equation becomes 

dx \^dy f J dy ' 

The solution y(x, X) of (7.57) is substituted into (7.56) to eliminate X. 

f a 
Example 7.9. Let us find y(x} such that the functional I[y] ~ / y dx is an 

r a 

extremal subject to the constraint / v 1 + (y') 2 dx = constant = b. Since the 

constraint states that the length of arc of the curve v(x] be a constant, this tvne of 
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problem is called an isopenmetric problem. We have / + \<p = y + X \/l + (?/') 2 > 

(f + X*>) = _ J^L=, A (f + A*,) = 1 

< Vi + e</') 2 d - ?/ 

so that (7.57) yields 



__ 

<fo L VI + (?/ 

or y"/[l + (?/') 2 ] 3 = 1/X. We recall that y" /[I + (y')*\l is tlie curvature, so that 
y(x) has constant curvature, and hence must be an arc of a circle. The radius of the 

circle is X, which can be obtained from / \/\ -f- (y')i dx = b. 

Example 7.10. If the Riemanman metric (see Sec. 3 6) is extremahzed subject to 

/*i dx a 

<p a (x l , x 2 , . . . , x n ) j ds, where <p a , a == 1,2, 3, 4, is the electro- 
: (18 

magnetic vector potential, one obtains the motion of a charged particle in a gnivitn- 
tiorial arid electromagnetic; field, 

d*x* dx> dx k e dx a e 

d? + r ^ "^ W + m * * ~fc ~ ' X = m 

Problems 

rb 

1. Find y(x) which exlremalizes the functional /[//) = / (?/) 2 dx subject to the 

Ja 

fb 

constraint I xy dx = constant. 

Ja 

2. Find the curve of constant length joining two fixed points with the lowest center 
of mass. 

3. The area underneath the curve y = f(x) from x = a to x = b is rotated about 

/b 
y dx constant = c . 
i 

4. Derive (7.57). 
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CHAPTER 8 

GROUP THEORY AND ALGEBRAIC EQUATIONS 



8.1. Introduction. The study of groups owes its beginning in an 
attempt to solve algebraic equations of degree higher than 4. The linear 
equation ax + b = 0, a 7* 0, has for its solution x = b/a. The solu- 
tion of the quadratic equation ax 2 + bx + c = 0, a ^ 0, is known to be 

~ 4ac). We 



note that x\ and x* are written in terms of a finite number of operations 
involving addition, subtraction, multiplication, division, and root extrac- 
tions. The operations are performed on the coefficients of the quad- 
ratic equation. We say that the quadratic equation is solvable by radi- 
cals. The cubic and quartic equations are solvable also by radicals. 
Lagrange attempted to extend this result to algebraic equations of degree 
higher than 4. He was unsuccessful, but his work laid the foundation 
which enabled Galois and Abel in the early part of the nineteenth century 
to grapple successfully with this problem. In general, an algebraic equa- 
tion of degree higher than 4 cannot be solved by radicals. The equation 
x 6 1 = can be solved by radicals, however. It remained for Cauchy 
to systematically begin the study of group theory proper. The theory of 
groups plays an outstanding role in the unification of mathematics. Its 
applications in mathematics are widespread, and, moreover, it has served 
an important role in the development of the modern quantum theory of 
physics. 

8.2. Definition of a Group. ^ We consider first some elementary exam- 
ples of groups. Let us consider the set of all rationale, excluding zero, 
subject to the rule of multiplication. We note that the product of two 
rationals is again a rational. If a, b, c are rationale, then the associative 
law, (a6)c = a(6c), holds. The unique rational 1 has the property that 
1 a = a 1 = a, f or all rationals a. Finally, for any rational a, there 
exists a unique rational, I/a = or 1 , such that ocr 1 = cr*a = 1. Let the 
reader show that the four elements (1, 1, i, i) possess these same 
properties under the operation of multiplication. Let us consider the 
90, 180, 270, and 360 rotations in a plane about a fixed point. 
Let us denote these rotations by AI, A%, A 3, and A 4 = E, respectively. 
By AiAi we mean a 90 rotation followed by a 180 rotation, etc. We 
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note that A*A,- = A*. A 270 rotation followed by a 180 rotation is 
equivalent to a 450 = 90 rotation. Moreover A>E = EAi = Ai foi* 
i 1, 2, 3, 4. E is the identity element in the sense that the rotatldij 
E leaves a body invariant. We also note that A\A* = A *A\ = Ey 
A 2 A 2 = E, EE = E, so that every element has a unique inverse. From 
the fact that 



let the reader deduce a correspondence between the rotations discussed 
above and the four elements (1, 1, f, i) under multiplication. The 
examples above lead us to the formal definition of a group. A discussion 
of sets can be found in Sec. 10.7. 

Let G consist of a set of objects A, B, C, . . . . An operator, , is 
associated with every pair of elements of G, (A, B) = A B. For con- 
venience we call the operator <8> multiplication and write A B = AB. 
The set G is said to be a group relative to the operator if: 

I. For every A and BofG,AB = C implies C is a member of G. This 
is the closure property under <S> . 

II. For all A, B, and C of G, 

(AB)C = A(BC) 

This is the associative law. 

III. There exists a unique element, #, of G, such that 

AE = EA = A 

for all A inG. E is called the identity, or unit, element. 

IV. For every A of G 1 there exists a unique element, written A"" 1 , such 
that 

AA~ l = A~ 1 A = E 

' We call A~ l the inverse of A. It follows that A is the inverse of A"" 1 . 

One can replace III and IV by: 

III'. For every A and B of G there exist unique X and Y of G such that 
AX - B, YA - B. 

By choosing 1? = A we see that every element A has a unique right 
and left identity, from (III'). Thus AEi = A, E 2 A = A. We show 
now that EI = jE 2 . Now A(S 2 A) = A A = (AJE 2 )A, so that A A# 2 
from (III'). Hence # 2 #1 from (III'). We show next that the iden- 
tity element for A is the same as that for B for all A and J5. We have 
AE A - E A A - A, 5#* - # B B - B. NowB(E B A) - (5#*)A - 5A, 
so that, from (III'), EaA = A, which implies EB JS^. Let the reader 
show that (III') implies (IV). 
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Example 8.1. Let x and y be elements of a set such that x* - e, y 3 = e, yxy = x. 
We consider the set of elements (x, y, y 2 = y - y, xy, xy 2 , e), e the unit element. Let 
us construct a multiplication table for these elements. We write x x = x 2 , y - y y*, 
y . y . y = i/ 3 , etc. From x 2 s= e we note that x is its own inverse, and from y 3 = e we 
note that t/ 2 is the inverse of y. If we desire to compute (xy*)x, we note that 

(xy z )x = (xy*)(yxy) = xy*xy = xexy = xxy =* x*y = ey = y 

(see Table 8.1). 

We note that each row and column of Table 8.1 contains the six elements of our set 
with no repetitions. If x ^ e, y ^ e t let the reader show that the six elements are 
distinct, and hence form a group. 

TABLE 8.1 





e 


x 


y 


y 2 


xy 


xy z 


c 


c 


x 


y 


y 2 


**'/ 


xy 2 


X 


x 


e 


*y 


xy 2 


y 


y 2 


y 


y 


xy* 


y 2 


e 


x 


xy 


y* 


y 2 


xy 


e 


y 


xy 2 


x 


xy 


xy 


y 2 


*ll 2 


x 


e 


y 


xy 2 


*y* 


y 


X 


xy 


y 2 


e 



Problems 

1. Verify Table 8.1. 

2. Consider the set of square matrices, ||a l; ||, i, j = 1, 2, . . . , n, such that 
|a t ,| 5^ 0. Show that this set of matrices is a group under multiplication. 

3. An Abelian group is one for which AB = BA for all A and B of G. Is the group 
of Example 8.1 an Abelian group? Show that the group of Prob. 2 is non- Abelian. 

4. A finite group is one containing a finite number of distinct elements. Show that 
we can replace (III) and (IV) by (III"): AB - AC implies B - C, and BA = CA 
implies B = C, for finite groups. 

5. We define a * a = a 2 , a a a~*= a 3 , etc. A group is said to be cyclic if a single 
element generates every element of the group, that is, an element a exists such that, 
if x is any element of the group, then x = a n for some positive integer n. Give an 
example of a cyclic group. 

6. Show that A - (A" 1 )" 1 . 

7. If A and B are elements of a group G, show that (AB)' 1 = B" 1 A" 1 . Generalize 
this result. 

8. Show that the set of rational integers (positive and negative integers including 
zero) form a group relative to the operation of addition. Do the set of rational 
integers form a group relative to the operation of multiplication? 

8.3. Finite Groups. A finite group is a group consisting of a finite 
number of distinct elements. We deduce now some theorems concern- 
ing finite groups and illustrate each theorem with an example. 
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THEOREM 8.1. The order of a subgroup is a divisor of the order of the 
complete group. The order of a group is the number of distinct elements 
of the group. H is a subgroup of G if H is a group and if, furthermore, 
every element of // belongs to G. If at least one member of G is not a 
member of //, we say that H is a proper subgroup of G. The proof of 
Theorem 8.1 is as follows: Let H consist of the elements E, A, B, . . . , F. 
If // s= G, there is no problem. Assume H a proper subgroup of (?, and 
let X be any element of G not in H. Construct the set HI of elements 
XE, XA, XB, . . . , XF. Let the reader show that these elements are 
distinct. Moreover, if XA = B, then X BA~ l is a member of H 
since H is a group. But X is not a member of // so that XA ? B. 
Hence every element of HI is distinct from every element of H. If the 
members of // and HI exhaust (?, then g = 2/i, where g is the order of G 
and h is the order of H. If this is not the case, let F be a member of 
G not in // or HI. We now construct the set II 2 consisting of YE, 
YA, . . . , YF. One easily shows that the members of 7/2 are distinct 
from each other and are distinct from the elements of H and HI. If 
//, HI, HZ exhaust G, then g 3h. If not, we continue in the same 
manner. Eventually we must exhaust G since G has a finite number of 
elements. Thus g = nh, and h divides g. 

Example 8.2. The group of Table 8.1 consists of six elements. A subgroup of this 
group is H(x, x* = e). The order of H is 2, and 6=23. Another proper subgroup 
of G is K(y, y 2 , y* = e), 6 =3-2. Is it possible for G to have a subgroup of order 4? 

THEOREM 8.2. Every subgroup of a cyclic group is a cyclic group. The 
definition of a cyclic group is given in Prob. 5, Sec. 8.2. Let G consist of 
A, A 2 , . . . , A = E. Let H be a proper subgroup of G with elements 

A\ A b s . . . , A 2& , . . . , A r = E b < 1), < - . < r 
Since 61 > 6, we have b\ = qb + s, ^ s < b. Then 

yj 61 __ ^ 



and A bl ~ qb = A 8 . If s ^ 0, then A 8 is a member of H, a contradiction, 
since b was assumed to be the smallest exponent of A such that A b is 
in H. Hence s = 0, and 61 = qb = 2b, since A b A b = A 2b is in H. The 
only elements of // are of the form A mb , so that // is cyclic. 

Example 8.3. Let G be a cyclic group of order 8, so that 
(a, a 2 , a 3 , a 4 , a 5 , a 6 , a 7 , a 8 = e) 

are the elements of G. Consider the subgroup //(a 2 , a 4 , a 6 , a 8 = e). We note that H 
IB cyclic since a 4 = (a 2 ) 2 , a 8 = (a 2 ) 3 , a 8 = (a 2 ) 4 . 
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THEOREM 8.3. A criterion for a subgroup is the following: Let G be 
a finite group, and let S be a subset of G such that the product of any 
two elements of S is again an element of S. Then S is a subgroup of G. 

Certainly the closure and associative properties hold for S. Now 
let A be any element of S. Then A, A 2 , . . . , A T , . . . belong to S. 
There can exist only a finite number of distinct elements of the type A k . 
Thus A n = A m for n > m y and A n ~ m = E belongs to S. Moreover 
AA n -- 1 = E, so that A n ~ m ~ l = A~ l belongs to S. Q.E.D. 

Example 8.4. Let us consider the permutations of the integers (1, 2, 3). \\e 
obtain the six permutations (1, 2, 3), (1,3, 2), (2, 1, 3), (2, 3, 1), (3, 1, 2), (3, 2, 1). 
We can consider the particular permutation (2, i-J, 1) as being obtained from a sub- 

(123\ 
/ 

In this way we obtain the six elements 

123\ /123\ fm\ /123\ /123\ n /123 



/123\ /123\ A23\ 

= \132/ Sl = V213/ &4 = \231/> 



If we consider a triangle with vertices labeled 1, 2, 3, respectively, then $6 states that 
we interchange the labels 1 and 3 and leave label 2 invariant. Let us consider any 
function of three variables, f(xi, x%, xa) The operation of 82 on f yields 

tfjfCri, 3*2, rs) = /(.f i, jj, 0*2) 
If we follow this }yy the operation *S 6 , we obtain 

Nftjf(.r], J'a, J^.t) = $bf(f}, ft, xt) = /(o- 3 , x-2, j*i) 
since >S6 permutes 1 into 3, 2 into 1, and 3 into 2. Thus 



and it is natural to define *S f 6J S 2 = ^Se, written ( ' ) ( 1097 ~ \Wi/" ^ ; can '^ 
upon ( ' ) ( *|' ) as follows: Starting with the right-hand side, we see that 1 goes into 

\ol2/ \lo-6/ 

1 , then, moving to the left, we see that 1 goes into 3. The final result is the permuta- 
tion of 1 into 3. Again, on the right-hand side, 2 goes into 3, and, moving to the left, 
3 goes into 2, so that the end result is to leave 2 invariant. 3 2 followed by 2 > 1 

\ iclds 3 * 1. The product yields ( ' 1 = Si. It follows that Si plays the role of 

the identity element of the group, and we leave it as an exercise to show that the ele- 
ments Si, i 1 , 2, . . . , 6, do, indeed, form a group relative to multiplication 
defined above. The order of the group is 3! = 6 Let the reader obtain a generaliza- 
tion for the permutation group of order n\ often called the symmetric group. 
We consider now the function 

f(Xi, 2, Xa) = Oi - 3- 2 )(j2 - X.i)(Xt 0*i) 

We note that Sif = /, Stf - -/, SV = -/, *S</ = /, Xt>f = /, Stf - -/, so that Si, 
St, arid S& leave / invariant. These elements are called the even permutations. It is 
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a simple matter to show that the product of two even permutations is again an even 
permutation. Hence, from Theorem 8.3, the set (S\, St t $5) is a subgroup of the 
symmetric group of order 3!. Do (82, Sz, $e) form a group? These are the odd 
permutations. 

Any subgroup of a symmetric group is called a regular permutation group. 

Problems 

1. Show that the elements of Example 8 4 form a group. Construct the multiplica- 
tion table for this group Is this group Abelian? Construct all the proper subgroups. 

2. Show that the product of two even permutations is an even permutation. Con- 
sider the cases of the product of an even with an odd permutation and the product of 
two odd permutations. 



3. We can write S 4 = oo = (m) m the SOIlse tlmt ] * 2 > 2 * 3 H 



can be written *S Y 6 = (1^)(2) Do the same for $i, $2, Nj, #5 

4. Show that the permutation group of order n ] contains a subgroup of order n!/2. 

6. C is called the commutator of two elements A and B of a group if C = (AB)~ 1 BA. 
For any element X of G show that A"" } ('X is the commutator of X~ 1 AX and X~ 1 BX. 

8.4. Isomorphisms. Let us consider two groups G\ and (7 2 . If a 
one-to-one correspondence can be established such that 

,1 <--> A' 
B <-> B f 
implies 

AB<-> A'B' 

for all A and B in 6 T , we say that the two groups G } and (7 2 are isomorphic 
to each other and write Gi = (V 2 . A', B', , . . belong to GV 

An isomorphism of two groups implies that the two groups are equiv- 
alent in the sense that we are using two different languages to describe 
the elements of the groups. It is apparent that any theorem obtained 
from the fact that 6 Y i is a group will also hold for (? 2 . 

Example 8.5. We consider a cyclic group of order 4 with elements A, A z , A 3 , 
A 4 = E along with a subgroup of the symmetric group of order 4' Let the reader 
show that the elements 

1234\ /1234\ /1234 



form a cyclic, group with 82 = flj, S.\ <S Y ], S* = E S\. It is a simple matter to 
show that the correspondence 

8,*-* A 1 i = 1, 2, 3, 4 
is an isomorphism. We leave this as an exercise for the reader. 

An important theorem due to Cayley is stated as follows: 
THEOREM 8.4. Every finite group G is isomorphic to a regular permuta- 
tion group. Let the elements of the group be written as E = Ai, A%, 
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. . . , A n . Let A k be any member of (?, and consider the elements 
A k Ai 9 A k A<i, . . . , A k A 3 , . . . , A k A n 

Since A* A \ belongs to (?, it must be equal to an A r , and by A k \ we mean 
A r . Similarly A k A% = Akt, etc. Let A k correspond to 



'' L) 



for k = 1, 2, . . . , n. We wish to show that (8.1) represents an iso- 
morphism. We have from (8.1) 



l 2 ' ' ' ? '\ 

l J'2 ' ' ' jn) 






/I 2 w \ / 1 

VI J'2 ' ' JM/VA-1 



which establishes the isomorphism. The isomorphism of Example 8.5 
was established in this fashion. 

An automorphism is an isomorphism of a group with itself. A simple 
example of an automorphism is as follows: Let A^ i 1, 2, . . . , n be 
the elements of a group (?, and let X be any element of G. For conven- 
ience we choose X ^ E. Let us construct the elements X~ 1 A 1 X^ i 1, 
2, . . . , n. If G is Abelian, we have X~ 1 A,X = ^ 1 X~ 1 X = A,E = 4 t . 
Let us assume that (7 is non-Abelian. Generally, then, X~ 1 A*X ^ A,. 
It is a simple matter to prove, in any case, that X~~ 1 A 1 X T X"~ 1 A J X f 
i ^ j, for if X~ 1 A,X = X~ l AjX, then ' 



a contradiction if i 9* j. Since the set S of elements X~ l A t X, i = 1, 
2, . . . , n, contains n distant elements and since every member of S 
belongs to G, S == G. We show now that the correspondence 

A t <-> X- 1 A,X i = 1, 2, . . . , n (8.2) 

is an automorphism. We have A 3 <-> A^ 1 ^^, 



which proves the automorphism of the correspondence (8.2). 

Problems 

1. If GI and (?2 are isomorphic and if G\ is cyclic, show that G* is cyclic. 

2. Find the regular permutation group which is isomorphic to the group of Table 8. 1. 
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3. Show that all cyclic groups of order n are isomorphic (see Prob. 1). 

4. An automorphism of the type (8.2) of the last paragraph is called an inner 
automorphism. Show that the only inner automorphism of a cyclic group is the 
trivial automorphism, A, <-> A t , i = 1, 2, . . . , n. This is called the identity 
automorphism. 

6. Prove that the product of two automorphisms is an automorphism. Hint: If 
A* <-> A( is an automorphism and A t < > A/ is an automorphism, then under the 
second automorphism we have A( <-> (A|)", so that A T <- (A,')" is another corre- 
spondence, called the product of the two automorphisms. Show that A t < )> (A t )" 
represents an automorphism. 

6. Show that the inner automorphisms of a group form a group. 

7. Show that the automorphisms of a group form a group 

8. Find the inner automorphisms of the group given in Table 8.1. 

8.5. Cosets. Conjugate Subgroups. Normal Subgroups. Let G be a 
group and H a proper subgroup of G. Let the elements of 77 be HI, 7/ 2 , 
. . . , 77 m , and let A be an element of G not in H . We consider the ele- 
ments HiA, H 2 A, . . . , H n A. The reader can quickly verify that 
H>A * H 3 A for i ^ j, and H t A ^ H,. If H t A = H,, then A = H~ 1 H 3 
is in 77, a contradiction. Thus the set of elements H 1 A J i = I, 2, . . . , 
n, are distinct and do not belong to 77 if A does not belong to 77. This 
set is not a subgroup of G since it does not contain the identity element 
E. We call this set a coset, written as HA . If the elements of 77 and 
HA exhaust G, we can write G = H + 77A. If not, we consider an 
element B of (7, B not in II and HA. We construct the coset HE and 
omit the proof that the elements of 777? are distinct from the elements 
of H and 77 A. We can continue this process until we exhaust (?, if G 
is a finite group. Thus 

G = 77 + HA + IIB + - - + HC (8.3) 

Example 8.6. We consider the symmetric group of order 4!. Let H consist of the 

(1 2 3 4\ 
01 A >3 ) Now 
2 1 4 o/ 



SiA 
S%A 
03 A 

$4 A 


/1234\ /1234\ /I 234\ 
V23417 V21437 V321 47 
A234\ /1234\ /1234\ 
V3412/ V2143/ \43217 
/1234\ /1234\ /1234\ 
\41 237 \2 1437 VI 4237 
/1234\ /1234\ /1234\ 
Vl 234M21 437 \21 437 



Note that SA 5^ >S,A, i ^ j, and that S % A ^ S } for all t, j. The elements /SA, 
i 1, 2, 3, 4, are members of the coset HA. The element 5 = ( . J is not a 
member of H or HA. We consider 
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_ /J234\ /1234\ _ /1234\ 

***** - \234lM2 41 3} ~ \3 12 4/ 

/1234\/1234\ /1234\ 

5sfls " V3412A2413/ = V4231/ 

_ /1234\ /1234\ _ /1234\ 

^ ~ Vl 1 2 3/ \24137 ~ V1342/ 

/1234W1234\ _/1234\ 

&4/ * = V1234A2413/ ~ V2413/ 

The elements &#, t = 1, 2, 3, 4, are members of the coset HB. Note that S t B 5* 5, 
for all ?, j, , 7* S,.l for all z, .;, &# 7* S,# for i 7* j. Continuing, we can obtain 
G = H + HA + HB + HC + ED -f- HF. 

THEOREM 8.5. The dements X and Y belong to the same coset if and only 
if XY~ 1 is in II. Assume XY~ 1 a member of //, so that XY~ l = HI, 
H l in H. Then .Y = //,7, and H,X = HJ],Y = H,Y for all f, with 
fly = HJHi. Thus every member of II X is a member of HY. From 
7 = H^X, H,Y = HJI^X = II k X for all i, we have that every mem- 
ber of # F belongs to HX. Thus flX = 7/F. Conversely, if HX = HY, 
then HI and ff 2 exist such that HiX = H 2 Y so that XY~ l = H^H^ 
a member of fl. Q.E.D. 

DEFINITION 8.1. If B = X~ 1 AX, we say that B is conjugate to A, 
with 4, B, X in G. We note the following: 

1. Every element is conjugate to itself, A E~ 1 AE. 

2. If B is conjugate to A, then A is conjugate to B, since B = X~ 1 AX 
implies A = -XjBJST- 1 = (Jf- 1 )- 1 ^- 1 . 

3. If ^1 and B are conjugate to C, then A and B are conjugate. We 
have 

A = X~ 1 CX B = F-VF = Y-*(XAX-i)Y - (A^ 1 F) 

Let us consider a subgroup H of G consisting of the elements E, A, 
B, . . . . Let .Y be a member of G not in PI. We form the elements 
X~ 1 EX = E, X~ 1 AX, X- 1 BX, ... and designate this set as X~ 1 HX. 
Since 

(X- 1 AX)(X-*BX) - X~ l (AB)X 

we have from Theorem 8.3 that the set X~ 1 HX is a subgroup of G. We 
have used the fact that // is a group, which implies that AB is in H if 
A and B are in //. We call H and X~ 1 HX conjugate subgroups. By 
considering Y^HY, etc., one can construct the complete set of conjugate 
subgroups of H. 

DEFINITION 8.2. If // is identical with all its conjugate subgroups, then 
H is called a normal, or invariant, subgroup of G. 
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Example 8.7. The even permutations of a symmetric group form a subgroup, H. 
If X is any odd permutation, X~ 1 AX is an even permutation if A is an even permuta- 
tion. Thus X~ 1 HX = H for all X so that H is a normal subgroup of G. 

THEOREM 8.G. The intersection of two normal subgroups is a normal 
subgroup. By the intersection of two subgroups HI, 77 2 we mean the 
set of elements belonging to both HI and 77 2 , written HI P\ H%. It is 
obvious that the identity clement E belongs to HI C\ 77 2 . If A and B 
belong to HI C\ 77 2 , then AB belongs to HI and to 77 2 so that AB belongs 
to 77 1 Pi # 2 . From Theorem 8.3, H l C\ 77 2 is a subgroup of the finite 
group G. We must show now that 77 1 C\ 77 2 is a normal subgroup if HI 
and //2 are normal subgroups. The set of elements X"~ l (Hi C\ H^X 
belong to HI since any element of HI C\ 77 2 belongs to 77 1 and, moreover, 
HI is normal. The same statement applies to 77 2 , so that every element 
of X~ l (H l C\ H 2 )X belongs to //i Pi H 2 . Thus H L C\ 77 2 is identical with 
all its conjugate groups so that H i Pi 77 2 is normal. 

Problems 

1. The elements ^ 2 '^J ^ ^ ^ j form a subgroup // of the symmetric group of 

order 3!. Find the right-hand resets II X of G. Find the left-hand cosets XH of G. 
Show that H is not a noi mal subgroup of G 

2. Show that the subgroup //, consisting of the elements <S, / = 1 , 2, 3, 4 of FAamplo 
8.5, is a normal subgroup of the symmetric group of order 4' 

3. If H is a subgroup of a cyclic group G, show that H is normal. 

4. A group G is said to be simple if it contains no proper normal subgroups. Show 
that all groups of prime order arc simple Show that all simple- Abehari groups are of 
prime order. 

6. Let M and N be normal subgroups of G containing only the identity E in com- 
mon Show that if M i and N\ are any two elements of M and A r , respectively, then 
MiNi = NiMi. Hint Consider C = Mi l N^ l MiNi, and show that C = E. 

6. Consider the set of elements E, Ai, A 2 , . . of G such that A~ l XA l = X for all 
X of G. Show that this set is an Abehan subgroup of (7, called the central of (7. Show 
that the central of G is normal. Find the central of the group given by Table 8.1. 
Find the central of the symmetric group of order 3'. 

8.6. Factor, or Quotient, Groups. The set of rational integers form 
a group relative to addition (see Prob. 8, Sec. 8.2). The zero element 
plays the role of the identity element. A proper subgroup of this group 
is the set of even integers including zero. The odd integers yield a coset 
of the group of even integers, so that 7 = 7 + /i, where 7 is the group 
of rational integers, 7 is the group of even integers, and I I is the set of 
odd integers. 7i is obviously not a group relative to addition since 7 t 
does not contain the identity element. Let the reader show that 7 is a 
normal subgroup of 7. If we add any element of 7 to any other element 
of 7 , we obtain another element of 7 . We may thus write 7 + 7 = 7 . 
If we add any element of 7 to any element of 7 b we obtain an element 
of 7i, so that 7 + 7i = 7, + 7 = Ii. Finally 7 X + 7i = 7 . Thus if 
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we abstract in the sense that we denote all even integers by the single 
element 7 and denote all odd integers by the single element 7i, we note 
that the two elements 7 , I\ form a group relative to addition, with 7o 
serving as the identity element. We generalize this result as follows: 

Let N be any normal subgroup of the group G. We consider N along 
with its cosets, 

. . . ,NA r (8.4) 



Since N is normal, we have X~*NX ~ N, that is, every element of the 
set X~ 1 NX belongs to N, and, conversely, every element of N belongs to 
X-WX. Thus NX XN, and NA t A t N, i = 1, 2, . . . , r, so that 
the right cosets of N are equivalent to the left cosets of N. Let us con- 
sider the product of two cosets of N. We have 



(NA t )(NA 3 ) = N(A 1 N)A, N(NA.)Aj = NN(A,A 3 ) = 
with N(A % A 3 ) another coset of N. Conversely, 

N(A 1 A 3 ) = NN(A 1 A J ) = N(NA V )A, (NA t )(NA,) 



If we look upon a coset as a single element, we note that the set (8.4) 
forms a group. The element TV is the identity element for this group 
abstracted from the group G and the normal subgroup N. 

DEFINITION 8.3. The group whose elements are constructed from the 
normal subgroup N of G and the cosets of N is called the factor group, or 
quotient group, of N, written G/N. The order, or number of elements, 
of G/N is called the index of the factor group. 

THEOREM 8.7. Let 77 be a proper subgroup of G, written H C G. If 
N is a normal subgroup of G such that TV C 7/ C G, then H/N C G/N. 
Let the reader show that, if N is a normal subgroup of G, then A" is a 
normal subgroup of the subgroup //, provided N C 77 C G. Thus 

H = N + NA 1 + NAz + + NA. 
andG/77 = (N, NA ly . . . , NA.). Now 

G = N + NA 1 + NA, + + NA, + NX + - - - + NY 

with G/N = (N, NA,, . . . ,NA.,NX, . . . ,NY). It is obvious that 
G/H C G/N. 

DEFINITION 8.4. The product HK of two subgroups 77 and K of G is 
the set of elements h t k 3 , where hi ranges over H and k 3 ranges over K. 

THEOREM 8.8. If 77 and K are normal subgroups of G, their product 
L = HK is a normal subgroup of G. First we show that HK is a sub- 
group of G and that HK ~ KH. Certainly the associative law holds 
since H and K are in G. Moreover kihjcr 1 = h s since H is normal, 
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so that kihz = h^ki and (/h/Ci)(/i 2 /c 2 ) = (hih^)(kik^) = hk C 77JL The 
identity element of HK is e, the identity element of G. We leave it to 
the reader to show that HK ~ KII and that the inverse of every element 
of HK is again an element of HK. Finally, for any element X of G, 

X~ l (HK)X = X- l (HXX~ l K)X = (X~ 1 HX)(X- 1 KX) 777 

since 77 and K are normal. Thus 777v is identical with all its conjugate 
subgroups so that HK is normal. 

DEFINITION 8.5. By a maximal normal group N of G one understands 
that N is not contained properly in any normal subgroup of G other than 
G itself. 

THEOREM 8.9. Let N \ and N% be normal maximal subgroups of (?, D 
their intersection, written D = NI P\ AV Then 

GYAT, ^ A/VZ> G/N* ^ AT 2 /7;> (8.5) 

The intersection of two subgroups of G is the set of elements common to 
both NI and AY D is nonvacuous since the identity element obviously 
belongs to D. The reader should refer to Sec. 8.4 for the definition of an 
isomorphism. 

From Theorem 8.8 the product NiNz is a normal group. Obviously 
NiNz contains both NI and N z . Since NI and A T 2 are maximal, it is 
necessary that NiNz = G. Since any subgroup of a normal group is 
normal, it makes sense to speak of N } /D. Now 



Ni = D + DAi + DA Z + - - + DA r (8.0) 

with A, 7* A 3 for i -^ j, and the cosets DA, arc distinct. Let L be the 
set of elements belonging to N%, NzA lf i = 1, 2, . . . , r, so that 

L = N* + NzAi + N Z A Z + + NiA r 
We show first that L = G. We have 

G = NiN 2 N 2 N 1 = N,(D + DA^ + - + DA r ) 



since ]V 2 ^ ~ N%. The cosets A^ 2 A t , z = 1, 2, . . . , r, are distinct, for 
A^A ~ A r 2 ^4 J implies N^A^Aj- 1 A^ 2 , which further implies that A 1 A J '~ 1 
is a member of A" 2 . Moreover A 1 A J ~ 1 is a member of NI since A, arid 
^ belong to NI [see (8.6)]. Thus A 1 A J ~ 1 belongs to both NI and A^ 2 
and hence to D. This implies DA 1 A~ 1 D, 7)^ t ~ 7)^4y, a contra- 
diction to (8.6). The correspondence of cosets, DA, -> A^ 2 ^4 t , i = 1, 
2, . . . , r, with 7) <- A^ 2 yields the isomorphism JVi/D = G/N 2 . Simi- 
larly JV 2 /D ^ G/Ni. Q.E.D. 



310 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

Example 8.8. We consider the cyclic group G of order 6 with elements a, a 2 , a 8 , a 4 , 
a 6 , a 6 = e. The three proper subgroups of G are Ni(a z , a 4 , a 6 = e), AT 2 (a 3 , a 6 = e), 
Ns(e). Let the reader show that A r i and A^ are maximal normal subgroups of G. We 
also have that D = N* JVi O A^. We have 



. 

G - (a 2 , a 4 , e) + (a 2 , a 4 , e)a = (a 2 , a 4 , e 
G = (a 8 , e) + (a 8 , e)a + (a 3 , e)a* = (a 3 , 

G/tf ! = [(a 2 , a 4 , 6), (a, a 3 , a')] 

G7tf 2 = [(a 3 , e), (a 4 , a), (a', a 2 )] 

tf,/D = [(a 2 ), (a 4 ), ] 

AT 2 /D = [(a 3 ), (e)] 



e) + (a, a 3 , a 6 ) 
3 , e) + (a 4 , a) + (a*, a 2 ) 



The correspondence (e) < (a 2 , a 4 , e), (a 3 ) < > (a, a 3 , a 6 ) is an isomorphism, N^/ D == 
G/Ni. Similarly N\/D ~G/N> 2 By (a 2 , a 4 , e) (a, a 3 , a 5 ) we mean all elements 
obtained by multiplying elements of (a 2 , a 4 , e) with elements of (a, a 3 , a 6 ). Note that 
(a 2 , a 4 , e) (a, a 3 , a 6 ) = (a, a 3 , a 6 ) so that the element [(a 2 ), (a 4 ), (e)] is the unit element 
of G/N i. 

Problems 

1. Find the maximal normal subgroups of the cyclic group of order 12. Consider 
any two such maximal normal subgroups, and show the isomorphism (8.5) by con- 
structing their cosets and finding their intersection. 

2. Find the normal subgroups of the group of Table 8.1. 

3. A group is said to be simple if it has no normal subgroup other than itself and the 
identity element Show that any group of prime order is simple. 

4. Show that G/H is simple if and only if // is maximal 

6. Let N be a normal subgroup of G, H any subgroup of G. Let D = H C\ N, 
L HN. Show that HN is a group, D a normal subgroup of H, L/N ^ H /D. 
6. Find the maximal normal subgroups of the symmetric group of order 24. 

8.7. Series of Composition. The Jordan-Holder Theorem. Let N be 
a normal subgroup of G. One then obtains the factor, or quotient, group 

G/N = [N, NA l9 NA^ . . . , NA r ] 

where NA lt i = 1, 2, . . . , r, is a coset consisting of the elements N,Ai 
with Nj ranging over the group N. Let T = [N, NBi, . . . , NB 8 ] be a 
normal subgroup of G/N. We prove the following theorem: 

THEOREM 8.10. Every normal group of a factor group G/N yields a 
normal group of G; each normal group of G which contains Af corresponds 
to a normal group of the factor group. The first part of Theorem 8.10 
states that the set 

H = N + NB 1 + + NB t 

is a normal subgroup of G. Now 



X~ 1 HX = X~ l (N + AT^! + + NB 8 )X 
= X~ 1 NX + 



Since N is normal, we have X~ 1 NX ^ N for all X in G. Since T is nor- 
mal, (NX^-^NB^NX is in T. Thus X~ l (BiN)X is in T and hence is an 
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element of H. Thus X~ 1 HX is an element of H for all X of (?, which 
proves that H is normal. It is obvious that G D H D N. Now assume 
that G D H D N with both // and AT normal. From 



H = N + NA l + NA 2 + - - + NA 8 

G = N + NA, + NA 2 + - + NA. + NA, +1 + + NA r 

it follows that H/N C G/N. We wish to show that H/N is a normal 
subgroup of G/N. From the fact that H is normal and that A % belongs 
to H we have that A~ 1 A^A 3 belongs to H. Thus 



~ (A- l AtAj)N since N is normal 
A^A-M^j) since N is normal 

= N(NA k ) AM* 

Thus the conjugates of H/N are members of H/N so that ///AT is 
normal. It may be that A~ 1 A T A 3 = E, the identity element, so that 
N(A~ l A t A,) = NE = N. Q.E.D. 

DEFINITION 8.6. Let Ni be a maximal normal subgroup of (?, N* a 
maximal normal subgroup of NI, etc. We obtain a series G, JVi, N 2 , . . . , 
N k = E, called a series of composition. 

The factor groups G/N\, Ni/N^ . . . , N k -i/N k are all simple groups 
(see Prob. 3, Sec. 8.6) for the definition of a simple group. If N % /N l+ i 
were not simple, Theorem 8.10 states that a normal group M would 
exist such that N l 3 M D JVt+i, a contradiction, since N l+ i is assumed 
maximal relative to Af z . These simple groups are called the prime factors 
of composition of G. The orders of G/Ni, Ni/N z , . . . are called the 
factors of composition of G. 

THEOREM 8.11. (Jordan-Holder). For two series of composition of a 
finite group G the prime-factor groups are isomorphic. 

This means that if we consider two arbitrary series of composition of 
G, say 

G = //O, //l, //2, - , H r = E , _, 

G = /Co, X i, X-2, . . . , K 8 = E 
with prime-factor groups 

Hi/Hi, . . . , Hr-i/H r 



then r = s and a one-to-one correspondence, i 4-4 ^ can be set up such that 

HJH^ ^ K } /K J+l (8.9) 

with i and j ranging from to r. 
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The theorem is evidently true if the order of G is prime, for in this case 
there exists only one normal subgroup of G, the identity element, and 
G/E = G/E. The proof of Theorem 8.11 is by induction. Assume the 
theorem true for any group G whose order, g, can be written as the prod- 
uct of n prime factors or less. We know that the theorem is true for 
n = 1. Now let G be any group whose order can be written as the prod- 
uct of n + 1 prime factors, arid let us consider two arbitrary series of 
composition [see (8.7) and (8.8)]. If H 1 = KI, then G/H l ^G/K l and 
the set of elements of HI is a group whose order contains at most n prime 
factors. By hypothesis H T /H l+ i ^ K } /K J+} for i ^ 1. We assume, 
therefore, that HI and KI are distinct maximal normal subgroups of G. 
From Theorem 8.9 

G///I ^ K./D, G/Ki ^ H l /D l (8.10) 

where DI = HI C\ KI. Since G/H i and G/Ki are simple, of necessity, 
Ki/Di and Hi/Di are simple. Hence DI i,s maximal (see Pro}). 4, Sec. 
8.6). Now form the series of composition 

G, // 7>n ;; 2| . . . , /> w = E 

G, A'i, DI, D 2 , . . . , D m = E 1S - IU 

From (8.10) and the fact that DJDz^D^D*, D 2 /D 3 ^ D. 2 /D 9 , etc., 
we see that the two series of (8.11) satisfy Theorem 8.11. But the series 
(Hi, H%, . . . , HJ, (Hi, DI, . . . , D m ) satisfy Theorem 8.11 by assump- 
tion, as do the series (Ki, K 2 , . . . , K r ), (Ki, DI, . . . , D TO ). Thus 
Theorem 8.11 holds for the two series (Hi, H 2 , . . . , H r ), (Ki, K 2 , . . . , 
K r ), so that with (8.10) it is seen that the Jordan-Holder theorem holds 
for any group G whose order can be written as a product of n + 1 prime 
factors. This concludes the proof by induction. Example 8.8 is an 
illustration of the Jordan-Holder theorem. 

Problem. Construct the multiplication table for the octic group whose elements 
are u l v j (i - 0, 1, 2, 3; j = 0, l)^with u* = 1, v z = 1, vu = u*v Find the different 
series of composition and prime-factor groups for the octic group, and show that the 
Jordan-Holder theorem applies. 

8.8. Group Characters. Representation of Groups. If to each ele- 
ment s of a group G we can assign a nonvanishing number, real or complex, 
written X(s), such that 

X(s)X(t) = X(st) (8.12) 

we say that we have a one-dimensional representation of the group and 
we call X a group character. From (8.12) we have 

X(s)X(e) - X(se) = X(s) 
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so that X(e) = 1, where e is the unit element of the group G. A trivial 
group character is X(s) 1 for all s of G. 

Example 8.9. Let G be a cyclic group of order n with generator a, a n == e. Let us 
define X(a r ) by the equation 

X(a r ) = ^2*,/ rt )r r = 1, 2, . . , n (8.13) 

We have A"(a r )A(a 8 ) = ?<2,n/) V (2T,/). = f ,(2ir,/)cr+.) = A(a r +'), so that (8.13) defines 
a character of the group A set of characters A\ A\ . . , X n -\ can be defined by 

A'^(a r ) = e< 2 "" ir M = 0, 1, 2, . . . , n - 1 (8.14) 

Let us note that 



(8.15) 

?=1 r=l 

The group characters of (8 14) are orthogonal in the sense that (8.15) holds Moreover 

n - 1 n-l 

AVa-)A->') - J <'"'"><- - o If' ; ^ I } (8^) 

M=0 M=0 

Let us now consider a set of n X n matrices AI, A 2 , . . . , A n and a group G with 
elements /i, </2, . , fin We assume that a one-to-one correspondence exists such 
that 



implies 

A t A ; <-> g t g, 

If su(;li is the case, it is easy to show that the set ol matrices is a group, A, isomorphic 
to the group G. We say that the group of matrices, A, is a representation of the 
group G. Any matrix A = ||aj || can be thought of as representing an affine (linear) 
transformation y l = a]x ] (see Chap. 1), so that a representation of a group implies 
that a group of afhne transformations is isomorphic to the group G. Let the reader 
show that, if g % <-> A t is a representation of G, then g t <- B~ 1 A 1 B is also a representation 
of G, |B| ^ 0. 

Example 8.10 A representation of the symmetric group of order 6 is easy to con- 

struct. The identity permutation ( ' ) can be looked upon as the transformation 
of coordinates x = x, y = y, z = z, or 

x = l:r + O// + 02 

y = Ox + I// + Oz 

z = Ox + ()(/ + \z 

yielding the unit matrix 

1 

Ai = 1 

1 

(1 2 3\ 
' J can be looked upon as the transformation of coordinates 
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x = ij = Ox 4- l/y + Oz 

y = x = 1 X + 0/y -I- 0,2 

z = z = Oj- + O// -f ^ 



yielding the matrix 



1 
100 
1 



I.et the reader obtain the matrices corresponding to the permutations 

Since every finite group is isornorphic to a regular permutation group (see Theorem 
8.5), it follows very simply from Kxample 8 10 that one can always obtain a representa- 
tion of a finite group. 

It may happen that a matrix B exists such that B^^B has the form 

t = 1, 2, . . . , n (8.17) 



where B r (A,), r = 1,2,. . . , /c, are matrices If this is so, the original 
representation is said to be reducible. One easily shows that the set of 
matrices {B r (Ai), B r (A 2 ), . . . , B r (A n )j is a representation of G. Thus 
(8.17) yields k new representations of G. It may be that further reduc- 
tions are possible. If not, the representation given by (8.17) is said to be 
irreducible. In this case the correspondence is written 








B 2 (A,) - 












B,(A,) 



+B 2 (A,) 



(8.18) 



where the sum in (8.18) denotes the matrix (8.17). Methods for deter- 
mining the irreducible representations of a group are laborious. The 
reader may consult the references cited at the end of this chapter for 
detailed information on irreducible representations. 

Problems 

1. Derive (8. 16). 

2. Find a representation for the group of Table 8.1 

3. If the correspondence g l <-> A, is a representation, show that X(g % ) = |A| is a 
character of the group G. 

4. Find a character for the group given by Table 8.1 other than the trivial character 
X(g l ) = 1 for all g % of G. 



8.9. Continuous Transformation Groups. 

formation of coordinates given by 



Let us consider the trans- 



(8.19) 
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where t is a parameter ranging continuously over a given range of values. 
The transformation (8.19) will be said to be a one-parameter transfor- 
mation group if the following is true: 

1. There exists a to such that 

* = /Or, //;M 
y = <p(x, ?/; to) 

This value of t, t to, leaves the point (x, y) invariant. This corresponds 
to the identity transformation. 

2. The result of applying two successive transformations is identical 
to a third transformation of the family given by (8.19). This implies 
that if x 2 = f(xi, 2/1 ; O, 2/z = ^(^i, yi' 9 ti) then 

za = /(/Or, /y; 0, v>(z, /y; ; fi) = /Or, /y; J 2 ) 
z/2 = <K/(-^ /y; t), <p(x, /y; /); Zi) = ?(:r, //; / 2 ) 

where 2 is a function of t and ]. 

3. The associative law must hold. 

4. Each transformation has a unique inverse. This implies that we 
can solve (8.19) uniquely for x and y such that 



?y = (f>(xi, 2/1 ; /i) 
with ^i some func^tion of ^. 

Example 8.11. The translations j-j = x + />, //i = y H- 2/> satisfy the requirements 
for a one-parameter group The identity transfonnation occurs for b = 0, and the 
inverse transformation is obtained by replacing b bj r b 
The rotations 

j*i = jc cos 6 t/ sin 
?/i = .r sin + // cos 

form a one-parameter group. = yields the identity transformation If 

j*j = Ji cos 0i //i sin 0i 
t/2 2:1 sin 0i + Vi ( ' os 0i> then 

x 2 = x cos (0 + 00 .v sin (0 + 0i ) Hi = x sin (0, -f 2 ) -f- /y cos (0i + 2 ) 

The parameter is additive for two successive transformations 

A third example is the group x = ax, i/i a z y. The identity transformation occurs 
for a s 1. The parameter is multiplicative for two successive transformations. 

Infinitesimal transformations can be found as follows: Let to be that 
value of the parameter t which yields tho identity transformation. Then 

^-M//,/) (8>20) 

y = <p(x, /y; /o) 
Tf (8.19) is continuous in , a small change in / will yield a transformation 



If / and (p are differentiable in a neighborhood of t = t Q , one has 

8x = (0 = ^ x ' ?^ ^ 
" 
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differing from (8.20) by a small amount. The transformation 

*i-/(*,y;fe + ) 
yi = v(x, y\t* + e) 

is said to be an infinitesimal in the sense that x\ and y\ differ from x and y, 
respectively, by small amounts when e is sufficiently small. Subtracting 
(8.20) from (8.21) yields 

5x = Xi - x = /(;r, y; U + e) f(x, ?/; / ) 

by = //i - y = *>0r, //; o + e) 



(8.22) 



except for infinitesimals of higher order, e = 5t. 

Example 8.12. For the rotation group of Example 8.11 one has ( -^ ) = y, 

\o6/ 0=.o 

(^ J = x, so that (8.22) becomes 
oU/ O-o 

8x = y bt by x bt 

Let us consider the change in the value of a function F(x, y) when the 
point (#, y) undergoes an infinitesimal transformation. One has 

dF dF 

8F = F( Xl , yi ) - F(x, y) = ~ 3x + ^ Sy 

except for infinitesimals of higher order. From (8.22) one obtains 

""Os + 'D" 23 > 

The operator U defined by 



is very useful. Equation (8.23) can be written as dF = (UF) dt. If 
t = corresponds to the identity transformation, we can replace 6 by 2, 
remembering that t is small. 

It is interesting to note that if one is given the system of differential 
equations 

= x 

(8.25) 
= y 
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then the solution of (8.25), written 

xi = /(z, 2/;0 

2/i = <f>(x,y;t) 



317 



(8.26) 



is a one-parameter group additive in t. To show this, we first note that 
(8.26) yields a curve starting at the fixed point (x, y). As t varies, Zi 
2/i vary (see Fig. 8.1). 

Now let us consider the system 



dy* 



2, 2/2) #2 = Xi at ti = t 

/2) 2/2 = 2/i at /i = t 

If we let tz = ti t, t fixed, (8.27) becomes 

,1*^ 

(2) Xz = x\ at tz = 

^2) 2/2 = 2/i at tz = 



(8.27) 



ctt 2 

^2/2 



(8.28) 



It i> s obvious that (8.29) yields the 



Since (8.28) has exactly the same form and initial conditions as (8.25), 
of necessity, 

2 = f(xi, 2/1; t z ) = /Cri, 2/1 ; 'i - ,g 29) 

At t ti we have x% = x\, 2/2 = 2/i- 
same curve as given in Fig. 8.1, and 
for 1 > t we obtain an extension of 
T to the point (# 2 , 2/2). Thus the 
product of two transformations be- 
longs to the group, and the parame- 
ter t is additive. The establishment 
of the inverse transformation is left 
to the reader. 

Let us now begin with a group 
transformation given by (8.19) and 
attempt to establish a system of 
differential equations involving x\, 
2/i, and t. From the group property 
we have 

22 =/(x, y,p(t, ti) 
2/2 = <P(X, y 1/3(1, ti 




P l (x l9 yj at t 



o 



= f(xi, 2/1 ; h 

= <P(XI, 7/1; 



FIG. 8.1 



(8.30) 



when Xi and 2/1 are replaced by the values as given by (8.19). We choose 
t = as that value of the parameter which yields the identity element. 
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Of necessity, 0(t, 0) = t, since /(zi, y^ 0) = Xi = f(x, y\ t). We differ- 
entiate (8.30) with respect to h, keeping x, y, and t fixed. Thus 



Evaluating at t\ = yields 



dt 
d<p(x, y,t) 

dt \dtj tl 

From (8.19), ^ = feliJl, ! 
at dt 



j, so that 



(8.31) 



where X(<) = (^ ) If t is additive, we have &(t, ti) = t + 

LVflii/i.-oJ 

X(0 = 1 and (8.31) becomes (8.25). In any case 



0&i, 2/0 i?C&i, 2/i) 

so that the change of variable, u = J\(/) dZ, reduces (8.32) to (8.25). 
Thus one can always find a new parameter such that the group transfor- 
mation is additive relative to the parameter. 

Example 8.13. We consider -~ xj, -~ = 1, with xi ** x,yi ** y &tt = Q. The 

solution is Xi x/(l xt), y\ =<"+?/. Let the reader show that an additive group 
has been obtained. 

Let us now consider an arbitrary function of x and i/, say, F(x, y). 
The value of F(x\, y\) for t small can be obtained as follows: Since 
^(zi, 2/i) = F(f(x, y, t), <p(x, y, t)), we expand F in a Taylor series about 
t = 0, assuming that this is possible. Thus 



F(XI, 



I 



dF(xi, 2/1) 



d^ 
*i 7,9 



(8.33) 



Now F(xi, y t ) 



, y). Also 
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dF(xi, 



dt 



[ 



i dt dyi dt 
dF dF 



provided the group transformation is additive. Moreover 



dt 2 



From 



one easily shows that 



t-o 



= U 2 F(x, y). By mathematical 



induction it can be shown that 
dF(x l9 yi 



dt n 



= UF(x, y) 



Equation (8.33) can be written in the form 

F(x l9 yi) = F(x, y) + (UF)t + (U*F) - 
For the special case F(XI, yi) = x\ we have 

%i(x, y> f) = x + (Ux)t + (U 2 x) ^ 
Example 8 14. For (x, y) y, ij(x, y) x we have 



(8.34) 



-2, + x ~ Ux = -y U*x - -x U*x ** y 



*U 4. 

so that 



, 
21 +4! 



\ /^, i 3 , < 6 \ 

)-H'~3! + 5! ---- ) 



x cos t y sin t 
Similarly y\ = x sin t -}- y cos /, and the rotation group is obtained. 
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Problems 

1. Show that the Einstein-Lorentz transformations form a one-parameter group 



- _ t - V/c 2 x 
~ 



Note that the parameter V is not additive. 

2. Find the transformation group obtained by integrating 



3. If Sl(x, y) is an invariant under a group transformation ll(x, y) = to(xi, y\\ show 
that Vil(x, y) = 0. 

4. Solve Ull(x, y) for the rotation group, and show that x z -f y z is an invariant 
for this group. 

5. Given (x, y) = y, r)(x, y) = x, find the transformation group. 

6. The differential equation -j- = / ( ~ ) remains invariant under the group trans- 

formation x txi, y = tyi, for all /, since - = and -~ = - - = ^ Let 

x Xi dx t dxi dxi 



D = In y = In i/i + In t = Vi 4- a, In t = a The equation -- = / f - j 
F I u, v, ~r~ j =0, and F ( u, v, ~ J is invariant under the translation 



becomes 



u = MI, v v\ + a 



Thus F must be independent of y, so that FI ( u, j- J =0 can be integrated by 



fly ^2 _1_ ^, 

separation of variables. Integrate -- = - -- 

^ dx x y 



8.10. Symmetric Functions. A function F(XI, x%, . . . , x n ) is a sym- 
metric fuiKition in x\, x^, . . . , o; n if any permutation on the subscripts 
1,2, ... ,n leaves / invariant. The function 



f(Xi, X 2 , #3) = ^10:20:3 + X\ + X\ + X 






is invariant under the symmetric group of order 6. If we expand the 
product (x Xi)(x xz) - (x Xn) y we obtain 

(X - Xi)(x Xz) ' ' ' (X X n ) 

= x n - <TiX n ~ l + o- 2 x n - 2 - <r 3 o: n - 8 + -lV 
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with (7i = xi 4- x 2 + x 3 + ' ' ' + x n 

(7 2 = XiX 2 + XiX 3 + ' + XiX n + X 2 X 3 + ' * ' + X n _lX n 

(7 3 = ZxtXjXk i ^ j ;* k 7* i (8.35) 



(7 n = XiX 2 X 3 ' X n 



The (7 t , i = 1, 2, . . . , n, of (8.35) are called the fundamental symmetric 
functions. It is apparent that any function of the o-'s is a symmetric 
function. A few examples lead us to suspect that the converse is also 
true. We note that/(x 3 , x 2 ) = x? + x\ can be written as 



(Xi + 0*2) 3 

with (7i = x\ + 2, ^2 = XiX 2 . The symmetric function 

/(Xi, X 2 , X 3 ) = xfx 



can be written as/(xi, x 2 , x 3 ) = (xiX 2 x 3 )(#i + x 2 + x*) = o- 3 <7i. 

THEOREM 8.12. The Fundamental Theorem of Symmetric Functions. 
If f(xi, x 2 , . - - , x n ) is a symmetric function (multinomial) in Xi, x<t, 
. . . , o: n , then /(si, x 2 , . . . , x w ) = g(<ri, o- 2 , . . . , <r n ), that is, / can be 
written as a multinomial in the fundamental symmetric functions <7i, 

0*2, - - - , o'n- 

The proof of the theorem is by mathematical induction. The theorem 
is certainly true for a function of one variable since f(xi) = /(0"i). Let 
us assume the theorem true for a function of n 1 variables. Thus if 
f(xi, x z , . . . , Zn-i) is symmetric, then / = g(<r lt (7 2 , . . . , a n ~\). Now 
let f(xi, X2, . . . , Xn] be any symmetric multinomial. Then f(xi, x^ 
. . . , x n -i, 0) is a symmetric function in Xi, 0^2, . . . , x n -i ; so by assump- 
tion we can write /(xi, x 2 , . . . , x n -i, 0) = g((ffi) Q , (cr 2 )o, . . . , (o- n -i)o), 
where vi = Xi + x 2 + - - - + x n , ((n)o = xi + x 2 + + x n _! + 0, 
etc., that is, (0-^)0 is the value of <r, evaluated at x n = 0, for j = 1, 2, 
. . . , n. Now if f(xi, x^ . . . , x n ) is a multinomial of degree 1, that is, 
/ == a(xi + x<z + - - + x n ), then the theorem is certainly true. We 
assume the theorem true for any symmetric multinomial of degree < k. 
This is a double mathematical induction type of proof. Now we con- 

sider h(Xi, X 2 , . . . , X n ) 3= /(Xi, X 2 , . . . , X n ) g(ffi, 0- 2 , . . . , <T n _i). 

It is obvious that h is also symmetric. Now 

h(Xi, X 2 , . . . , Xn-l, 0) = /(Xi, X 2 , . . . , X n _i, 0) 

. , (cTn-l)o) = 



so that x n = is a zero of A(XI, x 2 , . . . , x n ), which implies that x n is a 
factor of h. Since h is symmetric, of necessity Xi, x 2 , . . . , x n _i are also 
factors of h. Thus 

h(Xi, X 2 , . . . , X n ) = XiX 2 ' ' X n 8(Xi, X 2 , . . . , X n ) = ff n 8 
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where s is a symmetric function, and 

l, #2, , X n ) = g((Ti, (T 2 , . . . , <T n -l) + ff n s(Xi t Z 2 , . . . , X H ) 



The degree of s is k n < k, so that, by the induction hypothesis, 
s(xi, J*2, . . . , x n ) can be expressed in terms of the fundamental symmetric 
functions, yielding 



2, . , #n = 00" 1, 0*2, , OVi-l + <r n /(Ti, <T2, . . . , <T n 

This concludes the proof of the fundamental theorem of symmetric 
functions. 

Example 8 15. We consider 

/(a?,, x 2 , *,) - r? + a?i + s? -f 2ar,aj s + 2x,r 8 -f 2r 2 o- 3 - ^\x\x. - 3.r?r>2 - 
With patience one can show that / is symmetric. Let us consider 

f(xi, x 2 , 0) == x\ + *l + 2xix t = (ar, + xtf = [(r,) ] 2 
The function f(zi, X2, xt) a\ should have the factor ^1X2X3. We note that 

/ o\ = 3x 1X2X3(2; \xi 4- J*i^3 + 0*2X8) = 3<r2<rji 
so that / = <rf 3o- 2 o- 3 . 

Problems 

1. Show that 

(1 -f j-?)(l -f xj)(l + arj) = 1 + <r? - 2<r 2 + ^ - 2cr,r 8 + o- 3 2 

2. Let , /3, 7 be the zeros of f(x) x z -\~ px z + <?x H- ? Show that 

a l+JH 4. 2 + 7 2 , P* 4- T 2 = V - 2p 2 <y -f 4pr 
~-~ ~~ 



7 pq - r 

3. Referring to Prob. 2, show that 

2 /3 2 + <*V + )3 2 7 2 = q* - 2pr 

4. Consider 

Sl = Xi + X 2 -f -f X n 



= X\ 



X\ 






Express a* in terms of Si, 82, s 3 . 
8.11. Polynomials. We shall be interested in polynomials of the type 

n 

p(x) = Y c k x k = Co + Cix + dx* + ' ' ' + c n x n (8.36) 

fc-O 

The coefficients c t , i = 0, 1, 2, . . . , n, are assumed to belong to a field F. 
The reader is referred to Sec. 4.1 for the properties of a field. If a set 
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of elements belong to a field, we can perform the simple operations of 
addition, multiplication, and their inverses on these elements. The 
zero element and unit element belong to a field and the distributive laws, 
a(b + c) = ab + ac, (6 + c)a = ba + ca, hold. The complex numbers 
form a field. A subfield of the field of complex numbers is the field of 
real numbers. The rational numbers form a subfield of the field of 
real numbers. Let us consider the set of real numbers of the form 
a + b \/2> where a and b are rational numbers. The sum and product 
of two such numbers is evidently a number of this set. The unit element 
is 1 = 1 + \/2, and the zero element is = + -\/2. We must 
show that every nonzero number has a unique inverse. From the fact 
that \/2 is irrational we have a + b \/2 = if and only if a = b = 0. 
Assume a + b \/2 7^ 0. Then it is easy to show that [a/a 2 26 2 ] + 
[( 6) /a 2 2fr 2 ] \/2 is the unique inverse of a + b -\/2. Let the reader 
solve 

(a + b V2)(rc + y A/2) = 1 + \/2 

for x and ?/, a 2 2b* ^ 0. We call this field an algebraic extension of 
the field of rationals, written R(\/2). We shall discuss algebraic exten- 
sions of a field in the next section. 

If f(x) and g(x) are polynomials with coefficients in a field F, we say 
that g(x) divides f(x) if a polynomial h(x) with coefficients in F exists 
such that f(x) = g(x)h(x). An important theorem concerning poly- 
nomials is as follows: Given two polynomials with coefficients in F, say, 
f(x) of degree m, g(x) of degree n, there exist two polynomials with coeffi- 
cients in F, r(x) of degree < n, s(x) of degree < m, such that 

r(x)f(x) + s(x)g(x) = d(x) (8.37) 

where d(x) is the greatest common divisor of f(x) and g(x). All other 
divisors of both f(x) and g(x) are divisors of d(x). The coefficients of 
d(x) are in F. 

Proof. We consider the set of all polynomials of the form a(x)f(x) + 
b(x)g(x) t with a(x) and b(x) arbitrary. These polynomials have degrees 
(exponent of highest power of x) greater than or equal to zero. A con- 
stant is a polynomial of degree zero. Let a(x) = r(x), b(x) = s(x) be 
those polynomials for which the degree of a(x)f(x) + b(x)g(x) is a min- 
imum, but not identically zero, and let 

d(x) = r(x)f(x) + s(x)g(x) 

Any divisor of both/(x) and g(x) is obviously a divisor of d(x). We show 
now that d(x) divides both /(x) and g(x). If d(x) does not divide 
then by division (always possible since the coefficients are in a field) 

f(x) = q(x)d(x) + t(x) ^ degree of t(x) < degree of d(x) 
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d(x) ^ 0. Hence qd = qrf + qsg = / t, and 

(8.38) 



From the definition of d(x), (8.38) is impossible unless t(x) = 0, so that 
d(x) divides /(x), written d(x)/f(x). Similarly d(x)/g(x). Let the reader 
show that degree of r(x) < degree of g(x), degree of s(x) < degree off(x). 

Example 8.16. Let f(x) = x 2 + 1, g(x) = x. Then 1 (x z + 1) + (~x)x = 1, so 
that 1 is the greatest common divisor of f(x) and g(x). We say that /Or) and g(x) are 
relatively prime. 

Problems 

1. Show that the integers do not form a field. 

2. Show that the set of numbers x + ly -%/3> x an( l y ranging over all rational 
numbers, forms a field 

3. Show that x + 1 is the greatest common divisor of f(x) = x z -f 1, g(x) = (x + I) 2 . 
Find r(x) and s(x) such that r(x)f(x) + s(x)g(x} = x + I. Are r(x] and s(x) unique ? 

4. Let f(x) and 0(:r) be polynomials with coefficients in a field Fi, with F\ a sub field 
of the field F. If f(x) and r;O) are relatively prime with respect to the fipld /<\ show 
that f(x] and g(x) are relatively prime relative to the field F. How does this apply 
to Prob. 3? 

6. Do the set of numbers a -j- b \/2 + c \/3, a, 6, c rational, form a field? What of 
a + 6 V2 + c V3 + d >/6 ? 

6. Let F be any field. If we label the zero and unit elements and 1, respectively, 
and define 1+1 =2, etc., show that every field contains the rationals as a subfield. 

8.12. The Algebraic Extension of a Field F. Let p(x) = 

polynomial with coefficients in a field F. We say that p(x) is irreducible 
in F if p(x) cannot be written as a product (nontrivial) of two polyno- 
mials with coefficients in F. We say that p is a zero of p(x) or a root of 
p(x) =0 if p(p) = 0. Kronecker has shown that one can always extend 
the field F in such a fashion that a new number p is introduced, yielding 
p(p) = 0. The solution of x 2 + 1 = involves the invention of the 
number i = \/~~ ! 

THEOREM 8.13. Let p(x) be an irreducible polynomial in F such that 
p(p) = 0. If q(x) has coefficients in F, then q(p) = implies that 



Proof. Since p(x) is irreducible, p(x)/q(x), or p(x) and g(x) are rel- 
atively prime. From Sec. 8.11 

A(x)p(x) + B(x)q(x) 1 

for some A(x), B(x). Thus A(p)p(p) + B(p)q(p) =0 = 1, a contra- 
diction. Thus p(x) divides q(x). 

Two corollaries follow immediately from Theorem 8.13. 
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COROLLARY 1. p(x) is that irreducible polynomial of smallest degree 
such that p(p) = 0. 

COROLLARY 2. If we make the leading coefficient of p(x) unity, 
p(x) x n + c n -ix n ~ l + + Co, then p(x) is unique. 

DEFINITION 8.7. The n zeros of p(x), say, pi, p 2 , . . . , p n , are called 
conjugates of each other. The zeros of x 2 2, namely, \/2, \/2> arc 
conjugates of each other. It follows from Theorem 8.13 that any one of 
the n conjugates of p(x) determines p(x) uniquely, assuming that the 
leading coefficient of p(x) is unity. 

THEOREM 8.14. Let p be a zero of the irreducible polynomial p(x) 
with coefficients in a field F. The numbers 1 , p, p 2 , . . . , p n ~ l are lin- 
early independent relative to F, since if constants c, i = 0, 1, 2, . . . 

n - 1 n - 1 

n 1, exist in F such that ^ c^p* = 0, then t(x) = y c*a;* is a poly* 

k=o fc=o 

nomial in F of degree < n with f(p) = 0, a contradiction to Corollary 1. 
An important result is the following: Given the irreducible polynomial 
p(x) with coefficients in F such that p(p) = 0, one can obtain a field F\ 
containing p such that F is a subfield of F\. We call F\ an algebraic 
extension of F and write F\ = F(p). The elements of Fi(p) are of the 
form 



T 

I 



(8.39) 



with ^ 6 ; p 7 7^ 0, a t in F, b 3 in F. 

j=0 

Let the reader show that the elements of the type given by (8.39) form 
a field. By using the fact that p n + c n -ip n ~ l + ' ' ' + CQ = one has 

P H - ~ (Co + Cip + ' ' ' + Cn-lp 11 " 1 ) 



= - (Cop + Cip 2 + ' ' + C n -2p n ~ 1 ) 

+ C n _i(Co + Cip + ' ' ' + Cn-lp"- 1 ) 

so that powers of p higher than n 1 can always be reduced to lower- 
powers. For example, if p 3 + p + 1 =0, then 

P 3 = -(P + 1) P 4 = -P-P 2 

P 6 = -p 2 - p 3 = - P 2 + p + 1 etc. 

We show next that the elements a of (8.39) can actually be written as 
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8 

polynomials in p. The denominator of (8.39) is } b 3 p 3 . Let 

h(x) = 



y-o 

Since h(p) ^ 0, of necessity h(x) and p(x) are relatively prime. From 
Sec. 8.11 we have 

A(x)h(x) + B(x)p(x) = 1 

A(p)h(p) + B(p)p( P ) = 1 = A(p)h(p) 

so that the inverse of h(p) is A(p). Thus 

r n-1 

a(p) = A(p)~ 



t=0 t-0 

by using the fact that p(p) = to eliminate higher powers of p. 

Example 8.17. The field R(\/2) is obtained by considering x z - 2 = 0, p 2 - 2 * 0, 
p = \/2. The same field is obtained if we consider p = -\/2- This is not true for 
all cases. The coefficients of x z 2 belong to the field of rationale. The elements of 
/2(\/2) are of the type a + bp a -h b \/2, a and b rational. 

Next we consider the polynomial equation p(x) = x 2 3 = 0. It is easy to show 
that p(x] is irreducible in the field R(-\/2) since \/3 = a -f & \/2, a and 6 rational, is 
impossible. Let the reader show this. Now the coefficients of x* 3, namely, 1 and 
3, belong to R(\/2). Hence we obtain an algebraic extension of R(-\/2) by con- 
sidering the set of numbers of the type (a + b \/2) -f- (c 4- d \/2) \/3> a ) ^> c > d 
rational. This is the field /(\/2, \/3)> and its elements are of the form a -f- b \/2 -h 
c \/3 + d \/6, a, b, c, d rational. In a similar manner we could construct R(\/3 } \/2). 
It is evident that fl(\/2, V3) - J8(V3 V2) 

THEOREM 8.15. Let o: be any number of the algebraic extension F(p) 
obtained from p(p) = 0. Then a is the zero of a polynomial f(x) with 
coefficients in F. We say that f(x) = is the principal equation of x = a. 

Proof. Let the conjugates of p(x) = be p = pi, p 2 , p 3 , . . . , p n . 
Since a is in F(p), of necessity, 



a = ai = a 
We form the conjugates of i defined by 



= o + ip2 + a 2 pl + ' * * + an-ipS" 1 

= + flips + 2p3 + ' ' ' + fln-lpS" 1 



/o .QX 

Certainly f(x) s (re ai)(x x 2 ) (x n ) has i as a zero. We 
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need only show that the coefficients of f(x) are in F. Now 
f(x) = x n - 



Moreover 2ct,, S<x,a*, etc., are certainly symmetric in the p, i = i, 2, 
. . . , n, and so can be written as polynomials involving the fundamental 
symmetric functions, 

Pi + P2 4- " * 4- Pn 

Plp2 + Plp3 + ' * ' + Pn-lpn 



(8.41) 

PlP'2 ' ' ' pn 

The elements of (8.41) are simply the coefficients of p(x), except for some 
negative factors, as seen by expanding (x pi)(x p 2 ) (# p n ). 
Thus the coefficients of f(x) are in F. 

Example 8.18. Referring to Example 8.17, we consider a. = on = 3 4 \/2. 
Then 2 = 3 + 4 \/2, and 

f(x) - (x - ai)(x - 2 ) = (x - 3 4- 4 V2)(x - 3 - 4 -v/2) 
- (x - 3) 2 - 32 
= x 2 - 6.r - 23 

The coefficients of /(x) are in the field of rationals. 

Example 8.19. Let us find the principal equation of a 1 -f- p 2 , where p is a zero 
of p(x) = x* + 3x 4- 2. Let the zeros of p(x) be p = pi, p 2 , p 3 . Then 

f(x) = [x - (I + p*)][x - (1 + p 2 2 )Jlx - (1 -f P 3 2 )] 

= x 8 [3 + p 2 4- p 2 -f p 2 ]^ 2 + [(1 4- PI)(! -f p 2 ) + (1 4- p 2 )(l 4- p 2 ) 

From ai = pi 4" p2 4" PS = 

0"2 = P1P2 4- P2P3 + P3pl = 3 
<T3 = P1P2P3 = 2 

we have 

Pl + P2 4- P? = (Pl 4- P2 4- P3) 2 2(pip 2 4" P2P3 4- P3pl) 

-0 - 2-3 - -6 

Also (1 4- P?)(l + P!)(! + P?) - 1 + <rj - 2<r 2 4- a\ ~ 2o"io- 3 + rj 8 (see Prob. 1, 
Sec. 8.10). Moreover 

(1 -f P 2 )(l 4- P 2 2 ) 4- (1 + pf)(l -f- P 3 2 ) -f (1 4- pi)(l 4- P 3 2 ) 

==34- 2(pf 4- P 2 4- P 2 ) 4- (pip2 4- PIPS + psp2) 2 2pip 2 pa(pi -f p 2 4- PS) 
- 3 +2(-6) +9-0 

Hence f(x) = x 3 4- 3x 2 - 8. We check that 1 + p 2 satisfies 

(1 4- P 2 ) 3 4- 3(1 4- P 2 ) 2 -8 = 
We have 

/(I 4- P 2 ) - P 6 -f 6 P 4 4- 9p 2 - 4 

- (-2 - 3p) 2 + 6 P (-2 - 3p) + 9p 2 - 4 - 

since p 8 4- 3p 4- 2 0, or p 8 = -2 - 3p. 
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Problems 

1. Show that x s -f x + 1 =0 has no rational roots. Hint: Assume x = m/n is a 
zero of p(x) = x s + x -f- 1, m and n integers m lowest form Show that of necessity 
m 1, n = 1. By testing x = 1 show that p(x) has no rational zeros Why 
is p(x) irreducible in the field of rationals? 

2. The polynomial p(x] x 4 -f x 3 + 2x 2 -f z -f- 1 has no rational zeros. How- 
ever, p(x) is reducible m the field of rationals since 



Is this a contradiction ? 

3. Let p be a zero of p(x) x 3 -f x -f 1 (see Prob. 1). Express 1 + p + P 2 /1 p 
+ p 2 as a second-degree polynomial in p 

4. Let /(x) = ao -h i + + flnZ n , a, t = 0, 1 , 2, . . . , n, integers If a 
prime number p exists such that p does not divide a n , p divides o l for i < r?, /> 2 does 
not divide c?o, then /(x) is irreducible in the field of rationals. This criterion is due to 
Eisenstein. Show that x n p is irreducible in the field of rationals, p a prime. Show 
that f(x -f 1) = x* + 2x + 2 = (x + I) 2 + 1 is irreducible m the field of rationals. 
Why can one immediately make the same statement for f(x) x~ -f- 1 ? 

6. Consider p(x) = x 4 -f- 3x 2 -f- 9. Let the zeros of p(x) be p, p,<r, <r Show 
that the principal equation of a = 1 -f Op + p 3 is/(.c) = [(a; I) 2 81 ] 2 . 

6. Consider x 2 - 2 = ; z 3 - 2 = 0, and obtain the field R(\^2, \/2). 

7. A polynomial is said to be separable if it has no multiple zeros Show that if 
p(x) is irreducible it is separable. 

8. Consider f(x) = (x - ai)(x - 2 ) (j - ), the , defined by (8 40) If 
f(x) is irreducible, then f(x) = [/(a:)] 1 . If /(x) is reduci])lc in F, then 



Assume f\(x) is a function for which /i(i) = 0, so that f\(a\(x}} is zeio for x = pi. 
Why does p(x} /J\(ct\(x})1 Hence explain why f(ai(x)) is zero for j" = pi, P2, . . . , p n , 
so that f(a\) = f(a 2 ) =s - . . r= f(a n ) = 0. If the a, are not distinct, show that 
/W = [/i (x)l k , otherwise f(x) = fi(x). 

8.13. The Galois Resolvent. Let f(x) be a principal equation of 
x = a. We say that a is a primitive number of F(p) if f(x) is irreducible 
in F. 

THEOREM 8.16. If a = i is a primitive number of F(p), then 

F (a) = F(p) 

that is, every number of F(#) belongs to F(p), and conversely. 

Proof. Let f(x) (x a\)(x at) - (x a n ) be the principal 
equation of a = a\. Define <p(x) by 

^-^Vx-a.'x-a, -., 

The p, i = 1, 2, . . . , n, are the conjugate zeros of p(x). Then 

Since f(x) is assumed irreducible, /'(ai) ?^ (see Prob. 6, Sec. 8.12). 
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Thus pi = <f>(ai)/f'(ai), so that pi is a number in F(cti). Any number of 
F(pi) thus belongs to F(ai). Why? Since i is a polynomial in pi, it 
follows that any number in F(ai) is a number in F(pi). Q.E.D. 

THEOREM 8.17. Let pi be a zero of pi(x), a\ a zero of pz(x), Pi(x) and 
pz(x) irreducible in F. We can form the fields JP(pi), F(cri). If pz(x) 
is reducible in F(pi), we consider the irreducible factor of pz(x) having 
(TI as a zero. In this way we can form F(pi, a\) (see Example 8.17). 
Let the reader show that F(pi, a\) = F(cri, pi). We show that an irreduc- 
ible polynomial in F exists yielding a field F(r) such that F(T) = F(pi, <TI). 

Proof. Let 

TI = api + jSdi 

T2 = Oipi + /3CT 2 



with a and in F. The p,, t = 1, 2, . . . , m, are the conjugates of 
Pi(x) = 0, and the o-,, t = 1, 2, . . . , n, are the conjugates of ^(x) = 0. 
We (;hoose a and so that the r,, j = 1, 2, . . . , ran, are all distinct. 
This can be done since, if ap l + fa y = ap k + fa t , then 

,- ^ fc (8.42) 



P P^ Pk 

There are only a finite number of ratios in (8.42), so that a and ft can be 
chosen such that (8.42) fails to hold for all i p^ A:, j ^ I. For i = k we 
have faj 7^ fa i if <r 3 7^ (TI. For j = I we have ap, ^ ap^ for i 7^ k. Next 
we form 



g(x) = (x n)(x - r 2 ) (x r mn ) 

= x wn - (SrOz- 1 + + (-I) mn rir 2 T mn (8.43) 

Any interchange of two p's leaves the coefficients of g(x) invariant, as 
does any interchange of two o-'s. Thus the coefficients of g(x) are in F. 
That factor of g(x) irreducible in F having TI as a zero generates F(TI) 
with TI = api + fai- Since TI = api + fai, F(TI) is a subfield ofF(pi, (TI). 
We now show that any number ry of jF(pi, 0*1) belongs to F(n) so that 
F(n) = F(pi, o-i). If r; is in F(pi, o-i), T? is of the form 



(o 

+ (60 + 6ipi + + bm-ipf' 1 )^ 

+ ' ' + (C + Ctpi + ' + Cm-ipf- 1 )^?- 1 



n 1 m 1 



;-=0 i=0 
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with the d l} in F. The mn 1 other conjugates of p\ can be formed by 
replacing p\ by p 2 , p 3 , . . . , p m and <r\ by cr 2 , 0-3, . . . , <r n in all possible 
ways. We define (p(x) by 

(8.44) 



Let the reader show that 771 = <f>(ri)/g'(Ti), g'(ri) 7* 0, since g(x) has 
no multiple roots. Hence 771 belongs to F(r\). Q.E.D. 

Example 8.20. _ We consider x* _- 2 =_(), r 2 - 3 = with_pi = \/2, <TI = \/3- 
We form n = \/2 + \/3, r 2 = \/2 - \/3, r 3 = - V2 + \/3, T 4 = - \/2 - \/3- 
No two of the T'S are equal. Then 

0(x) - (x - n)(x - n)(x - TI)(X - T 4 ) = x 4 - lOo- 2 4- 1 

The coefficients of g(x) are in the field of rationale. It can be easily shown that g(x} 
is irreducible in F. Thus g(x) generates F(\/2 -f* \/3)> which has elements of 
the form 

a + 6(\/2 -f \/3) + c(A/2 -f A/3) 2 + d(\/2 -f V3) 3 = x + ?y V2 -h ?* \/3 -f w\/6 
j, y, u, v rational. Thus F(\/2 + \/3) = /^( V2, \/3) (see Example 8.16). 

A special case of Theorem 8.17 occurs if pi(#) = p%(x) and p\ ?* a\ = P2- 
The field F(p\, p^) can be generated. Continuing, we can adjoin to F all 
the roots of p(x) = 0, obtaining F(pi, p 2 , . . . , p n ). A single number r 
exists such that F(r) = F(pi, p 2 , . . . , p n ). The irreducible polynomial 
having r as a zero is called the Galois resolvent of p(x) 0. It is a poly- 
nomial of degree ^ n!, since we need only find constants {a*} such that 



n 

7 



and the other T'S obtained from the n\ 1 permutations of 



the p^s are different from each other. The rest of the proof proceeds as 
in Theorem 8.17. 

DEFINITION 8.8. If pi, p2, . . . , p n are the zeros of the irreducible 
polynomial p(x) such that 

F( Pl ) = Ffa) = = F( Pn ) 



then p(x) or p(x) =0 is said to be normal and F(p) is called a normal 
extension of F. In this case F(pi) = F(pi, p 2 , . . . , p n ) and p(x) is its 
own Galois resolvent. 

Example 8.21. Consider p(x) x 8 3x -f 1 =0. Let p be a zero of p(x). By 
division x 8 3x -f 1 = (x p)(x 2 -f px -f p 2 3). The other two roots of 
p(x) - are pi, 2 - (-1 Vl2 - 3p 2 )/2. We show that 12 - 3/> 2 is a perfect 
square in F(p). Assume 12 3p 2 = (a + bp -H cp 2 ) 2 , a, 6, c rational. Using the fact 
that p 8 3p - 1, p 4 = 3p 2 - p, we have 12 = a 2 - 26c, = -c 2 -|- 2ab + 66c, 
3 = 52 -f- 3c 2 -f" 2ac. It is easy to check that a = 4, b I, c = 2 is a rational 
solution so that pi = 2 p p 2 , p 2 p 2 2, which implies F(p) = F(pi) F(pt), 
and p(x) is normal. 
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Problems 

1. Consider p\(x) = x* 2 = 0, p 2 (x) = x 4 2 = 0, with pi(rc), p z (x) irreducible 
in the field R of rationals. Find 7?(\/2, v^) 

2. Find a Galois resolvent for p(x} = x 3 + 2x + 1. 

3. Show that p(a*) = j 8 3(r 2 + 3s 2 ) j* + 2r(r* -j- 3s 2 ) is normal for r and s integers 
provided p(x) is irreducible in the field of rationals. 

4. The discriminant of p(x} = x 3 + px + 9 is A = 4p 3 27</ 2 . Show that the 
discriminant of p(x) of Prob. 3 is a perfect square. 

5. Show that 171 *>(ri)/0'(n), ?(z) defined by (8.44). 



8.14. Automorphisms. The Galois Group. Let us consider the field 
R(\/2) composed of all elements of the form a + b \/2, a and b rational. 
We consider the one-to-one correspondence between a + b \/2 and its 
conjugate, a b \/2, written a + b \/2 <-> a 6 \/2. Let us look 
more closely at this correspondence. If 

x + y \/2 *-+ x y \/2 
u + v \/2 <-> u v \/2 
then 

(x + y v/2) + (u + v \/2) = (x + u) + (y + v) \/2 

<-* (x + u) - (y + v) \/2 

= (x - y -v/2) + (7/ - r \/2) 
(x + y \/2)(u + v -x/2) = (xu + 2yv) + (xv + yu) \/2 

-> (xw + 2yv) (xv + yu) \/2 

= (x - yV2)(u - v -s/2) 

A one-to-one mapping of a field into itself, written a <-> ', or a! = /(a), 
a = f~ l (a f ), such that a <^ a', <- /3', imply a + ft <-> a' + 0', a/3 <~> 
a'0', for all a and of the field, is called an automorphism of the field. 
From above we see that the mapping a + b \/2 <-> a b -\/Z is an auto- 
morphism of R(\/2). Every field has at least one automorphism 
attached to it, the identity mapping, <-, for all a of the field. Not 
all one-to-one mappings of a field into itself yield an automorphism. 
Witness the mapping x <- 2x, with x ranging over the field of real num- 
bers. For y <- 2y we have xy <- 2(xy) ^ (2x)(2y). Under an automor- 
phism the zero element maps into itself, as does the unit element, for 
a <-> a', <-> x, imply a + = a <-> a' + x = a', so that x = 0, and 
a <-> a', 1 <-> y, imply a - 1 = a *-* a'y = a', so that t/ = 1. We say that 
and 1 are left invariant under any automorphism. Let the reader show 
that the rationals of a field F remain invariant for all automorphisms of 
F. It is also a simple matter to show that the automorphisms of a field 
form a group. If a = /i(/3), = f z (y) are automorphisms AI and A^ 
we define A^Ai as the automorphism a = fi(fz(y)) = /S(T). Let the 
reader show that under this definition of multiplication the set of auto- 
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morphisms of F form a group. The unit element of the group is the 
identity automorphism. 

Let us return to the irreducible polynomial p(x) with coefficients in F. 
We shall be interested in the extension field N = F(pi, p 2 , . . . , p n ) with 
p t , i = 1, 2, . . . , ft, such that p(p l ] = 0. We consider only those auto- 
morphisms of F(pi, p 2 , . . . , p n ) which leave the elements of F invariant. 
The identity automorphism is one of this type. Let the reader show that 
the set of all automorphisms of F(pi, P2, . . , pn) leaving the elements 
of F invariant form a group. 

DEFINITION 8.9. The group G of automorphisms of F(pi, p2, . . . , p) 
leaving F invariant is called the Galois group of p(x) = 0, or the Galois 
group of F(pij p 2 , . . . , p n ) relative to F. 

THEOREM 8.18. If p(x) is normal (see rfec. 8.13), the Galois group of 
p(x) contains exactly n automorphisms. 

Proof. First of all we have 

P2 , . . . , Pn) = F( Pl ) = F( P z) = ' = f (p n ) 



^^ 

since p(j) is assumed normal. Jf p(x] N a^' fc , then a* <- A- for any 

fc = 

automorphism of the Galois group G. Let pi correspond to a for any 

TJ ri 

automorphism of G. Then = N akp\ <-> y a^ = 0, so that cr must 

fc=0 Jfc=0 

be a zero of q(x). Thus (pi <-> pi), (pj <~^P2J, . . . , (PI <-> p n ) are the 
only possible elements of 6 Y . We must remember that (pi *-* p t ) com- 
pletely determines an automorphism of G since any element of F(pi, p 2 , 

n-l 

. . . , p n ) is of the form N b^pj, with ?>*, fc = 0, 1, 2, . . . , n 1 in F. 

k*=0 

If p(#) were not normal, we could not make this statement. If p(x) is 
not normal, the order of G is less than or equal to n\ Why? Finally, 
we shall wish to use the faet that the Galois group G can be made iso- 
morphic to a regular permutation group (see Theorem 8.4). 

THEOREM 8.19. Let p(x) be normal. If a is any element of F(p = pi) 
which remains invariant under all automorphisms of (?, then a is an ele- 
ment of F. 

Proof. We know that all elements of F remain invariant under G. 
We wish to prove the converse if p(x) is normal. Let a = a\ be an ele- 
ment of F(pi), and consider its conjugates. Then 

ai = b Q 
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and f(x) = (x ai)(x az) ' ' ' (x a n ) is the principal equation of 
a = a\ (see Theorem 8.15), and the coefficients of f(x) are in F. But 
i = a 2 = * = oi n since a\ is left invariant under G. Thus 

f(x) (x oi) n = x n nax n ~ 1 + + ( l) n a n 

so that na is in F, which implies that a is in F. Q.E.D. 

DEFINITION 8.10. If the Galois group is cyclic, the equation p(x) = 
is said to be cyclic. 

7? 

THEOKEM 8.20. If p(x) ^ dkX k is cyclic and normal, we can 

solve for the roots of p(x) =0 by a finite number of rational operations 
(addition, multiplication, etc.) and root extractions on elements of F(w), 
where w n = 1, w ^ 1, arg w 2ir/n. The degree of p(x) is n, and the 
coefficients of p(x] are in F. It can also be shown that F(w) can be 
generated by a finite number of rational operations and root extractions 
on the elements of F. We omit proof of this latter statement. 

Proof. Since p(x) is normal, G is composed of exactly n elements. 
Moreover G is assumed to be cyclic, so that a single element generates G. 

We represent this element by the permutation P = ( ^ . . J. Let 

us consider the set of elements 

y III _L Q>n-l 

1 = PI + P2 4- Pi + -r Pn = 

'2 = Pl + Wp<> + W"pt + + W"~ l p n /0 Ar ^ 

fc - p, + ,'p, + -V:< + " + -'>p. (8 ' 45) 



The t , i = 1, 2, . . . , n, certainly belong to F(p, w), p = pi. Now 
P(f 2 ) = P2 + wpt + + w n ~ l pi = 2/1^, considering w as a parameter 
in 2 . Thus [P( 2 )] ri = 2 since w n = 1. But [P( 2 )] n = P(?) since 
raising 2 to the nth power and then permuting the indices 1,2, . . . , n 
is equivalent to first permuting the p's and then raising the new entity 
to the Tith power. Thus P( 2 ) = 2 so that ^ is invariant under P of G. 
From P 2 (?) = P[P(2)1 = P(?) = 2, etc., we see that is invariant 
for all elements of G. The same result applies to 7, 5, J, . . . , 
Now 

2 = tto(pl, P2, > Pn) 

+ Ol(pl, P2, , Pn)W + ' ' ' + a n _i(pi, p 2 , . . . , Pn)^"" 1 

by direct expansion of 2 , since iy n = 1. We wish to show that a , 
01, . . . , a w _i are in F so that 2 is in /^(ty). Under any permutation 
of G we have a <-> aj, ai <-> aj, a 2 <-^ a 2 , . . . , a n _i <-> a^. Since ^ is 
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invariant under (?, we have 



+ a n ~\w n ~ 

= aj + a[w + aX + + a'^w*- 1 (8.46) 

for all w such that w n = 1. Now the equation 

q(x) =b + b,x + - + kn-i^- 1 = 

has exactly n 1 roots. Equation (8.46) is of this type yet has n dis- 
tinct zeros, namely, 1, w, w 2 , . . . , w n ~ l . Thus (8.46) can only be true 
for all w satisfying w n = 1 if a = aj, ai = a[, . . . , a n -i = a^-i. Hence 
Oo, ai, . . . , a n _i are invariant under (r, so belong to F by Theorem 8.19. 
Hence 2 is in F(w). The same result applies to ?, 3, , & The 
t , i = 1, 2, . . . , n, of (8.45) are nth roots of elements inF(w), w = e 2?ri/n . 
We now look upon (8.45) as a linear system of equations in the 
unknowns pi, p 2 , . . . , p n . This system has a unique solution provided 
the Vandermonde determinant 



D(w) = 



I 



1 



1 



w 
w 2 



w n 



w 4 



(8.47) 



is different from zero. Let the reader show that 

n-2 

D(w) = H (w t - w>) T* 0, j > i 



Hence the p t , i = 1, 2, . . . , n, can be expressed as linear functions of the 
with coefficients in F(w). Q.E.D. 



Example 8.22. In Example 8.21 we saw that p(x) = x 8 3x + 1 = is normal. 
It is also cyclic since its Galois group is of order 3. Let the roots of p(x) =0 be 
PI, PI, pa. Then 

r = PI 4- pa H- PS = 
f 2 PI -h WPJ + W*PJ 

^ = Pl + f 2 p2 + ?P3 



with w 3 = 1, w; 5^ 1, w 2 + w -f 1 =0. We must find 
that pipa -f P2P3 -h pspi == 3, pip2p.< = 1. Now 

pi 4- (W + W' 2 )(P1P2 -f P2p.i + Pa 
P.O 2 -f (P1P2 + P'2p.{ + P3pl)(? - 



(8.48) 
using the additional fact 



= P? 
== (pi 

= 9 



P2 



P2 



?# 2 2) 



g + ^3 - 2(p'] + P 2 4- Pj) - 3(P!P2 + P2P3 -f P'PI -f PIP. -f P 

= 2(p] + p2 -h p-j) 3 0(pi -j- p 2 -f ps)(pip2 + P2pt 4- 
= -27 

~ ^) 2 = (& 4- ^) 2 - 4gg - (-27)2 _ 4(9)3 = _ 2)18 7 
8 2 -^ -27 V3* 



+ P?P 8 ) + 

i) 4- 27pip2ps 
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Hence g = -27 y-' 6 = -27 lL-.. Note that gg - 9. The 

cube roots of |, 3 must be chosen so that 2 s = 9. Note that l and j belong to 
R(w) = J?(\/3 t). It is now a simple matter to solve (8.48) for pi, p 2 , pa, which we 
omit. 

Before concluding this chapter let us consider the following: We con- 
sider p(x) = x 4 + 9, irreducible in the field of rationals. Let the roots 
of x 4 + 9 = be p, p, (7, cr. From p( p)<r( <r) = 9 we see that 
o- = 3/p = 3p 3 /p 4 = Tip 3 - Hence p(x) is normal. The automor- 
phisms comprising the Galois group of p(x) are (p<-p), (p<- p), 
(p <-><r), (p <-> o-), and we can represent (7 by the permutation group 

_/1234\ /1234\ _/1234\ _/1234\ 

Pl ~ V1234/ 2 ~\2143/ P3 ~V3412/ P *" \432iy 

with pi = p, p 2 = p, PS = 0", P4 = o". This group is not cyclic. The 
subgroup Gi = (Pi, A) leaves invariant a certain number of elements of 
F(p). Let us see which of these elements are invariant under G\. PI 
leaves all elements invariant. If a = a + bp + cp 2 + dp 8 is left invari- 
ant under P%, of necessity 

a + bp + cp* + dp 3 = a + fe(-p) + c(-p) 2 + d(-p) 8 

which implies that 6 = d 0. Hence the elements a + 6p 2 are left 
invariant under G\. It is easy to show that these elements form a sub- 
field F(p 2 ) of F(p). Gi is called the Galois group of F(p) relative 
Note that F(p) D F(p 2 ) 3 F. 

The principal equation of a + 6p 2 is 



= [x - (a + 6p 2 )][x - (a + 6 P 2 )][x - (a + ba*)][x - (a + 6cr 2 )] 
= [(x - a) 2 - 6(p 2 + * 2 )(x - a) + 6 2 pV 2 ] 2 
= [(x - a) 2 



so that a + 6p 2 satisfies a quadratic equation [of lower degree than p(x)] 
with coefficients in F. Thus we can solve a quadratic equation for p 2 , 
obtaining p 2 in terms of rational operations and extractions of roots of 
elements in F. Another square root yields p. Of course we know all 
this in advance since it is trivial to solve for the roots of x 4 + 9 = 0. 
Let us note, however, that Gi in the above example is a maximal normal 
subgroup of G and that E is a maximal normal subgroup of G\. The 
factor groups G/G\ and G\/E are of order 2 (a prime number). The fact 
that 2 is a prime number and that a group whose order is prime is cyclic 
(see Theorem 8.20 for the importance of this fact) leads to the all-impor- 
tant theorem, which we state without proof. 

THEOREM 8.21. If G is the Galois group of an equation p(x) = rela- 
tive to its coefficient field F, a necessary and sufficient condition that 
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p(x) = be solvable by radicals relative to F is that the factors of com- 
position of G be entirely primes. 

It can be shown that in general a polynomial of degree greater than 4 
cannot be solved by radicals. The equation x b 1 = 0, however, can 
be solved by radicals. Factoring, we have 

(x - l)(.r 4 + a: 8 + x* + x + J) = 
Considering x 4 + x 3 + x 2 + x + 1 =0, let 



so that upon division by x 1 one has if- + y 1 = 0. One solves for y 
and then for x from x 2 yx + 1 = 0, 

Problems 

1. Solvo the quadratic x 2 -f- bx + c by the method of this section. 

2. Show that the automorphisms of a field form a group. 

3. Show that D(w) of (8 47) is given by D(w) = IT(w - w>), j > i 

4. Solve; for the roots of .r 6 1 . 

5. Find the polynomial with coefficients in the field of rationals having 

P = \/\/2 + 1 
as a zero. 
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CHAPTER 9 

PROBABILITY THEORY AND STATISTICS 



9.1. Introduction. It appears that the mathematical theory of prob- 
ability owes its formation to the inquisitiveiiess of a professional gambler. 
In the seventeenth century Antoine Gombaud, Chevalier de Mere, pro- 
posed some simple problems involving games of chance to the famous 
French philosopher, writer, and mathematician, Blaise Pascal. It is to 
mathematicians Pascal and Fcrmat that probability theory owes its 
origin, though since their early works a great number of mathematicians 
have contributed to its development. Its applications range from sta- 
tistics to quantum theory. An understanding of probability theory is 
necessary to undertake studies of the modern theory of games and infor- 
mation theory. It will be useful, however, to consider some elementary 
combinatorial analysis before we attempt a definition of probability. 

9.2. Permutations and Combinations. Let us assume that we have a 
collection of white balls numbered 1 to m and a collection of red balls 
numbered 1 to n. In how many ways can we choose exactly one red 
and one white ball? To each white ball we can associate n red balls. 
Since there are m white balls, it appears that there are m - n ways of 
choosing exactly one white and one red ball. In this example we note 
that the choice of a white ball does not affect the choice of a red ball, 
and conversely. The events are said to be independent. We state with- 
out formal proof the following theorem: 

THEOREM 9.1. If there are m ways of performing a first event and n 
w#ys of performing an independent second event, there are m n ways of 
performing both events. 

The use of Theorem 9.1 enables us to solve the following problem: 
Given a group of objects numbered 1 to n, in how many ways can we 
order these objects? We have n choices for the object which is to be 
placed first in our order. After a choice has been made there are n 1 
choices for the object that is to be placed second in the order. Con- 
tinuing, we see that there are n(n l)(n 2) 2-1 = n' ways of 
arranging or permuting the objects. If we wish to consider the number 
of arrangements or permutations of n objects taken r at a time, we arrive 
at the answer n(n 1) (n r + 1). We write 
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n P r = n P r = P n r = n(n - 1 ) (n ~ 2) ' (n - T + 1) 

n(n - 1) (n - r + l)(n - r) 



r)! 



- r)(n - r - 1) 1 

(9.1) 



Example 9.1. How many four-digit numbers can be formed from the integers 
1,2, . . . ,9, using any integer at most once? We have Pj == 9 f /5! = 3,024 as our 
answer. If we were to use the integers as many times as we pleased, our answer would 
be9 4 6,561. 

Example 9.2. Given 3 red flags, 4 white flags, and 6 black flags, how many signals 
can we send using the 13 flags? Let us assume for the moment that we can dis- 
tinguish between the red flags by numbering them 1, 2, 3, with the same statement 
concerning the white and black flags. It is then obvious that 13! different signals 
could be sent. Now the red flags can be permuted in 3' == 6 ways. Each permuta- 
tion, however, yields no new signal if the red flags are indistinguishable. We must 
divide 13! by 3! to account for this characteristic of the red flags. By considering the 
white and black flags we arrive at the correct answer, 13'/3!4!6!. 

The permutations of the integers 1, 2, 3, 4 taken two at a time are 
listed below: 

12 13 14 23 24 34 
21 31 41 32 42 43 

A particular arrangement of objects considered independent of their 
order or permutation is called a combination. We see that there are 6 
combinations of the integers 1 , 2, 3, 4 taken two at a time. Although 12 
and 21 are different permutations, they yield a single combination. If 
we wish to hire 3 secretaries from a group of 12, we are usually not 
interested in the order of hiring the secretaries. Now let us see in how 
many ways we can choose r objects from among n, the order of the choices 

being immaterial. Call this number n C r "C r = C? = f J. Every 
combination will yield r! permutations. Thus H C r r\ = H P r so that 

nl (9.2) 

v ' 



r\(n r)! 

Note that 1 = ( J = n!/n!0! so that 0! is chosen to be 1. What does 
n (?o signify? 

Example 9.3. In obtaining 5 cards from among 52 cards in draw poker we are not 
interested in the order in which the cards are dealt. Thus there are 52 6 combinations 
of cards which can be dealt. Let the reader show that 62 C 6 = 2,598,960. Let us 
determine how many full houses (three of a kind and a pair) can be dealt. If we con- 
sider the particular full house consisting of a pair of fives and three aces, we note that 
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there arc 4 Ct 4 ways of obtaining three aces. Thus there are 6 4 = 24 different 
combinations yielding three ares and a pair of fives. But there are 13 choices for the 
rank of the card yielding three of a kind, and then 12 choices remain for the choice of 
the rank of the pair. Thus there are 24 13 12 = 3,744 different full houses that can 
be dealt. 

Example 9.4. Let us consider a rectangular m X n grid. If we start at the lower 
left-hand corner and are allowed to move only to the right and up, how many different 
paths can we traverse to reach the upper right-hand corner? Ail told, we must move 
a total of m -f- n blocks. Once we choose any m blocks from among the m -f- n blocks 
for our horizontal motions, of necessity, the remaining n blocks will be vertical 
motions. Thus there are m4n ( t w = (m -f ri)\/m\n^ different paths. Note that the 
answer is symmetric in m and n. 

Example 9.5. Let us consider (x -f- y) n j n a positive integer. In the product 
(x -f- y)(x + y} ' ' ' (x + */) we note that terms of the type x r i/ H will occur. Since a 
term from each factor must be used exactly once, of necessity, r + s = n. The term 
x r y 8 can occur in exactly n C r ways since we can choose x from any r of the n factors. 



r=0 



Thus (x + y) n = 2^ ('*) x r y n - r . If we choose x = y = 1, we have 2 n = ^ f * V 

7 =0 

AJ 

From (1 + x) n = Y (") 
w V / 

<+->- lO")- 

fc = 

Equating coefficients of a-* yields 



<. + -(:)-. 


we have 




?< it 


2 A/ fr 




1 1 (DC)- 

6 = () r = 


_ y v (\ ( 

L, L, \>) V* 

A = / = 


-.) 



7- = 

Problems 

1. In how many ways can an ordered pair of dice be rolled? Ans. 3(> 

2. Which number will occur most often when a pair of dice is rolled? Ans. 7 

3. How many handshakes can 10 people perform two at a time? Ans. 45 

4. How many five-card poker hands can be dealt consisting of exactly one pair 
(other three cards different) ? A ns. 1,098,240 

5. Considering the grid of Example 9.4, show that the grid contains 

(m -f- l)(n -f l)mn 
4 

rectangles. 

6. Consider a function of p variables. How many nth distinct derivatives can be 
formed? Hint: The grid of Example 9.4 is useful if we designate the horizontal lines 
by xi, 2 , . . . , x p . Ans. n+ p~ l C p -i 

1. By differentiating (1 -f x) n show that (*) = (* ~ j) + ( H ~ 1 )' 

n 

8. Show that Y (~l) r l I =0. 



Y (- 
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9. By considering (1 -f x) n (l x) n = (1 x z ) n show that 
2k 



10. In how many ways can the integers 1, 2, 3, 4, 5 be ordered such that 110 mtegei 
corresponds to its order in the sequence? For example, 21453 is such an ordering, 
while 21354 is not, since 3 occupies the third position in the sequence. 

An,r' 1 X ' ' ! 

9.3. The Meaning and Postulates of Probability Theory. Whereas the 
psychologist, economist, political scientist, etc., can, with some measure 
of success, communicate their ideas to the layman, the mathematician, 
unfortunately, realizes that he has no hope of describing his chief math- 
ematical fields of interest with a reasonable prospect of being understood. 
A modicum of hope prevails when we consider probability theory from 
its most elementary viewpoint. It is rare indeed to find a man who has 
not evinced interest at one time or another in breaking the bank at 
Monte Carlo. However, one need only consider the vast number of 
persons who believe that someday they will discover an invincible sys- 
tem of gambling, to realize how little the layman actually understands 
the basic ideas of probability theory. Probability theory has been called 
the science of common sense. If this is the case, it is strange indeed that 
two persons using common sense can differ so greatly from each other in 
solving a problem involving probability theory. 

Let us now investigate the meaning of the word "probability." Web- 
ster's New International Dictionary states that probability is "the like- 
lihood of the occurrence of any particular form of an event, estimated as 
the ratio of the number of ways in which that form might occur to the 
whole number of ways in which the event might occur in any form." 1 
The same dictionary defines "likelihood" as "Probability; as, it will rain 
in all likelihood." 1 Let us consider the question, What is the probability 
that it will rain on Sept. 9, 1957, in Los Angeles? To some mathemati- 
cians this question is meaningless. There is only one Sept. 9, 1957. We 
cannot compute the ratio of the number of times it has rained on Sept. 9, 
1957, to the total number of days comprising Sept. 9, 1957, rain or shine, 
at least not before Sept. 9, 1957. After Sept. 9, 1957, the ratio would be 
zero or 1 depending on the weather. It is up to the meteorologist to give 
us a precise answer. Either it will or it will not rain on Sept. 9, 1957, in 
Los Angeles. If there is no front within 1,000 miles of Los Angeles on 
Sept. 8, 1957, the meteorologist would predict no rain with a high degree 

1 By permission. From "Webster's New International Dictionary," Second Edi- 
tion, copyright^ 1934, 1939, 1945, 1950, 1953, 1954, by G & C. Merriam Company. 
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of "probability." But the word " probability " would not be used as a 
mathematician defines probability. Let us now ask, What is the prob- 
ability that a five occur if a die be rolled? This question is, in a sense, 
very much like the question asked above. Theoretically, if we knew the 
initial position of the die and if we knew the stresses and strains of the 
die along with the external forces, we could predict exactly which of the 
six numbers of the die would occur face up. The same statement can 
be made about the weather. Unfortunately for the meteorologist there 
are too many variables involved in attempting to predict the exact state 
of the weather. The hope of the meteorologist is to reduce the number 
of relevant factors to a minimum in an attempt to predict the weather. 
The die problem differs from the weather problem in the following way: 
If we have the patience and time, we can continue to roll the die as often 
as we please. If after n throws we note that the number five has 
occurred r n times, we can form the ratio r n /n. One would be naive in 
calling r n /n the probability of rolling a five, even if n is large. There 
are some who would define the probability of rolling a five as 

p = hm - (9.3) 

n * oo 11 

The limit of a sequence cannot be found unless one knows every term of 
the sequence (the nth term of the sequence must be given for all n) . To 
compute (9.3), one would have to perform an infinite number of exper- 
iments, an obvious impossibility. 

Let us turn, for the moment, to the science of physics. Newton's 
second law of motion states that the force acting on a particle is propor- 
tional to the time rate of change of momentum of the particle. The 
particle of Newton's second law of motion is an idealized point mass. No 
such mass occurs in nature. This does not act as a deterrent to the 
physicist. The motion of a gyroscope is computed on the basis that ideal 
rigid bodies exist. The close correlation between experiment and theory 
gives the physicist confidence in his so-called laws of nature. The math- 
ematician working with probability theory encounters the same difficulty. 
He realizes that a die or a roulette wheel is not perfect. Man, however, 
has the ability to make abstractions. He visualizes a perfect die, and, 
furthermore, he postulates that if a die be rolled all the six numbers on 
the die are " equally likely" to occur. It is, of course, impossible to 
prove that the occurrence of each number of a die is equally likely. 
With these idealized assumptions and definitions the mathematician can 
predict the probability of winning at dice. The success of the gambling 
houses in Las Vegas is sufficient evidence to the professional gambler that 
the idealized science of probability theory is on firm ground. The pure 
mathematician is not interested in games of chance, per se. Probability 
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theory, to the pure mathematician, is simply a set of axioms and defini- 
tions from which he derives certain consequences or theorems. We now 
consider the axioms of probability theory. The reader is urged to read 
Sec. 10.7 concerning the union, intersection, complement, etc., of sets. 
We shall find it advantageous to consider a simple example before treat- 
ing the general case. 

If a pair of dice are rolled, the following events can happen as listed 
below: 



E: 



(1, 1) 


(2,1) 


(3, 1) 


(4, 1) 


(5, 1) 


(6, 1) 


(1,2) 


(2,2) 


(3,2) 


(4,2) 


(5,2) 


(6, 2) 


(1,3) 


(2,3) 


(3,3) 


(4,3) 


(5,3) 


(6,3) 


(1,4) 


(2,4) 


(3,4) 


(4,4) 


(5,4) 


(6,4) 


0,5) 


(2,5) 


(3,5) 


(4,5) 


(5,5) 


(6,5) 



(1,6) (2,6) (3,6) (4,6) (5,6) (6,6) 



E is the collection of all possible events. The elements of E are called 
elementary events. A subset of E, say, FI, is the set of events guarantee- 
ing the occurrence of the number 1 on at least, one of the two dice. 

/-V (1,1), (1,2), (1,3), (1,4), (1,5), 

(1,6), (2, 1), (3, 1), (4,1), (5, 1), (C>, 1) 

Another subset of E is the set F 2 consisting of the events (1, 2), (2, 1), 
and (6, 6). The union, or sum, of F\ and F is the set of all events or 
elements belonging to either FI or F 2 or both, written F\ U F 2 F\ + F 2 . 
In this example FI + F 2 consists of all the elements of FI plus the event 
(6, 6). The intersection, or product, of FI and F 2 , written FI P F 2 = FiF 2 , 
is the set of elements belonging to both FI and F 2 . In this example FiF 2 
is composed of the elements (1, 2) and (2, 1). If two sets FI and F 2 have 
no elements in common, we say that their intersection is the null set, 
written FiF 2 = 0. The complement of a set F is the set of elements of 
E not in F. What is the complement of the set FI defined above? The 
complement of the full set E is the null set 0. Now let G be the set of 
all subsets of E including E and the null set 0. The elements of G are 
called random events. G is said to be a field of sets in the sense that the 
sum, product, and complement are again elements of G. Let us now 
attach to each element of E (these elementary individual events are also 
elements of G) the nonnegative number -$. There is no mystery as to 
the choice of the number -^. First we note that E is composed of 36 
elements. The assumption that all 36 events are equally likely implies 
that any single event has a probability of -$ of occurring. If A is any 
subset of E consisting of r events, we define the probability of an ele- 
ment of A occurring as p(A) = r/36. Thus P(Fi) = , P(F 2 ) = ^, 
P(E) = 1. We define P(0) = & = 0. Thus to each set A of G we 
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have defined a real nonnegative number. If A and B have no element 
in common, then P(A + B) = P(A) + P(B). 

Formally, a field of probability is denned as follows : Let E be any col- 
lection of elements x, y, z, . . . , which are called elementary events, 
and let G be a collection of subsets of E. Assume that the following 
postulates or axioms are satisfied : 

I. G is a field of sets. 

II. G contains E and the null set. 

III. To each set A in G there corresponds a nonnegative real number, 
written P(A). The number P(A) is called the probability that an ele- 
ment of A will occur. 

IV. P(E) = 1, that is, one of the events of E is sure to happen. 

V. If A and B of G have no element in common, then 

P(A + B) = P(A) + P(B) 

The single toss of a coin is a simple example of a field of probability. 
E is composed of the elements H (for heads) and T (for tails). G con- 
tains the sets //, T, E = (H, T), 0. We define p(H) = J, p(T) = , 
p(H, T) = 1, p(0) = 0. p(E) = 1 implies that a head or tail will cer- 
tainly occur. 

Events, such as tossing a coin, rolling dice, spinning a roulette wheel, 
yield finite probability fields since there are only a finite number of dif- 
ferent events which can occur. If 5 cards are dealt from a pack of 52 
cards, there are exactly 52 6 different events which can occur. It is 
logical to postulate that the probability of any single event occurring 
from among the 52 C y & different events be given by p = 1/ 52 CV This is 
what is meant by an "honest" deal. 

It is important that the reader understand Postulate V. In rolling a 
die the event A = (1, 3) means that A occurs if a one or a three is face 
up on the die. The event B = (4) means that B occurs if a four is face 
up. Now A and B have no elements in common, that is, if A occurs, 
B cannot occur, and conversely. We say that A and B are mutually 
exclusive events. For a perfect die, p(l, 3) = f , p(4) = , so that Pos- 
tulate V states that p(A + B) = f -f i = 1- The probability that a 
one, three, or four occurs is |, the expected answer. A simple working 
rule for the reader is the following: 

RULE 1. If A and B are mutually exclusive events with probabilities 
p(A) and p(B), respectively, then the probability that either A or B 
occurs is 

p(A +B) = p(A) + p(B) 

The reader should give an example for which p(A + B) < p(A) + 
p(B). 
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Problems 

1. What is the probability of throwing a seven with two dice? Ans. ^ 

2. What is the probability of throwing a four, eight, or twelve with two dice? 

Ans. -J- 

3. Five cards are dealt from a deck of 52 cards. Find the probability of obtaining 
the following poker hands: three of a kind, a flush, a straight. 

88 33 10 - 4 6 



4. A set of balls are numbered from 1 to 20. Two balls are drawn simultaneously 
at random. What is the probability that their sum is 14? Hint: There are 20 C 2 = 190 
wrys of drawing two balls. Ans. $$ 

6. In Prob. 4 what is the probability that the sum of the two numbers is 14 if a ball 
is drawn and replaced and then a second ball is drawn? Ans. -j^ 

6. Five coins are tossed. What is the probability that exactly three heads occur? 
What is the probability that more than two heads occur? Ans. ^ 6 -, ^ 

7. If n coins are tossed, show that the probability that exactly r heads occur is 
C r /2". 

8. If a coin is tossed and a pair of dice are rolled, what is the probability that a head 
occurs and a total of 5 is rolled? Ans. ^ 

9. Show that Postulates IV and V imply p(0) = 

10. Let the complement of A be denoted by A so that A + A = E. Show that 
p(A) - 1 - p(A). 

11. Let A and B be elements of a field G. If p(A) 9* 0, define p A (B) by 



PA(B) is called the conditional probability of the event B under the condition A 
Show that 

p B (A)p(B) = 



Give a geometric interpretation of PA(B], assuming A and B are point sets in a plane 

9.4. Theorem of Bayes. Let us consider the following simple example : 

Assume that one throws a dart at a 
target whose area is taken to be 1. 
We assume further that the dart is 
sure to strike the target and that the 
dart is equally likely to strike any- 
where on the target. This means 
that if A is any area of the target then 
the probability that the dart strikes 
a point of A is simply the area of A, 
written p(A). Now let A and B be 
overlapping areas (see Fig. 9.1). 

Let us consider a second person 
located in another room who knows 
that a dart will be thrown at the target. The probability that the dart 
falls in A will be given by p(A). Let us now assume that after the dart is 




FIG. 9.1 
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thrown our second person asks the following question : Has the dart fallen 
in the area B? He receives the truthful answer, "yes." How does this 
affect this person 's opinion as to the probability that the dart has also 
fallen in A ? We can answer this question in the following manner : It is 
fairly obvious that if it is known that the dart lies in B then the only way 
it can also lie in A is for the dart to have struck the region A C\ J3, and 
the probability of this happening before it is known that the dart has 
fallen in B is simply the area of A H B, written p(A C\ B). Since the 
dart is known to be somewhere in B, it appears that the ratio of the area 
of A C\ B to the area of B will yield the probability that the dart also 
lies in A C\ B. For example, if A C\ B is ^ the area of B, then the chance 
that the dart is in A C\ B if it is known that it is in B is simply ^. We 
write 

- P(B) * (9.4) 

and define pn(A) as the probability that the event A has happened if it 
is known that the event B has happened. We may also say that ps(A) is 
the conditional probability of the event A under the condition B. 

It is to be noted that (9.4) is a definition and requires no proof. Experi- 
mentally, however, one might attempt to verify (9.4) to some extent. 
Every time an observer learns that the dart is in JB, he records whether 
the dart is in A or is not in A. He is not concerned with those experi- 
ments for which the dart lies outside of B. The ratio of the number of 
successes to the total number of tries (the dart must be in B} yields a 
fraction lying between zero and 1. For a large number of experiments 
it is reasonable to hope that this number is close to the number p B (A). 

Since the roles of A and B can be interchanged, we have from (9.4) 

} P(A) * (9.5) 

Combining (9.4) and (9.5) yields one form of Bayes's theorem, 



since p(B C\ A) = p(A C\ B). p(A) is called the a priori probability 
that A occurs, whereas ps(A) is called the a posteriori probability that 
A occurs under the hypothesis that B occurs. 

Example 9.6. A group of 100 girls contains 30 blondes and 70 brunettes. Twenty- 
five of the blondes are blue-eyed, and the rest are brown-eyed, whereas 55 of the 
brunettes are brown-eyed, and the rest are blue-eyed. A girl is picked at random. 
The a priori probability that she is a blonde and blue-eyed is -j^nr = -J. If we pick a 
girl at random and find out that she is blue-eyed, what is the probability that she is 



346 ELEMENTS OF PUKE AND APPLIED MATHEMATICS 

blonde? Let x denote the quality of being blonde and y denote the quality of being 
blue-eyed. We are interested in determining p y (x). Now p(y) = 3^^ == 0.4, since 
40 of the 100 girls are blue-eyed. Moreover, p(x C\ y) = -J-. Applying (9.4) yields 



P(y) 

Note that there are 25 blue-eyed blondes and 15 blue-eyed brunettes, so that the 
appearance of a blue-eyed girl means that the probability of the girl being a blonde is 
= . In information theory one determines the ratio of the conditional proba- 
bility to the a priori probability and defines the amount of "bits" of information as 

the logarithm (base 2) of this ratio In this example, / = Iog2 3 = Iog2 -pf 1. 

We can extend the result of (9.5) as follows: Let E be a collection of 
elementary probability events Ai, A*, . . . , A n with the A l , i = 1, 
2, . . . , n, mutually exclusive, so that 

E = A l + A,+ - - + An = VJ A, 
Now let B be any subset of E. Then 

Let the reader deduce that B C\ A % , i = 1, 2, . . . , n, are mutually 
exclusive. Thus 

p(B) = p(B C\ .40 + p(B C\ A.) + + p(B C\ A n ) 

p(B H A.) (9.8) 



Applying (9.5) yields 

p(B) = p(A t )p Al (B) (9.9) 



Substituting (9.9) into (9. 6) "yields Bayes's formula, 



P(A,) = ~~ 



J p(A t )p At (B) 



It is important to realize that the event B need not be a subset of E 
in the ordinary sense. For example, the events A x , A 2 , . . . , A n might 
be different urns containing various assortments of red and white balls. 
The event B could be the successive drawing of three red balls from any 
particular urn, each ball being returned to its urn before the next draw- 
ing. Formulas (9.7) to (9.10) would still hold. By B n A l = A l H B 
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we mean the composite event of choosing AI and then drawing three 
successive red balls as described above. 

Example 9.7. Let us consider two urns. Urn I contains three red and five white 
balls, and urn II contains two red and five white balls. An urn is chosen at random 
(p(Ij) = -1 ? p(I 2 ) = ^), and a ball chosen at random from this urn turns out to be red 
(pi(K) - |, pn(R) = y). What is the a posteriori probability that the red ball came 
from urn I? Applying (9.10) yields 

1 3 91 

_ .__a_j[ ^ ff 

Example 9.8. Two balls are placed in an urn as follows: A coin is tossed twice, and 
a white ball is placed in the urn if a head occurs, with a red ball placed in the urn if a 
tail occurs. Let Ao, AI, A 2 represent the events of the urn containing none, one, and 
two red balls, respectively. Balls are drawn from the urn three times in succession 
(always returned before the next drawing), and it is found that on all three occasions 
a red ball was drawn. What is the probability that both balls in the urn are red? 
Let B be the event of drawing three successive red balls. We are interested in com- 
puting PB(AZ). Applying (9.10) yields 



p(A )p Ao (B) + p(Ai)p Al (B) + p(A<t}pAi(B) 

Fromp(Ao) = ?,p(Ai) = , p(A 2 ) = T, p A9 (B) = 0,p Al (B) = (^) 3 = ?,p At (B) = 1, 
we obtain pB(A 2 ) =3-. Let the reader show that, if B n represents the successive 
drawing of n red balls, then pB n (A>i) tends to 1 as n * . Is this reasonable to expect? 
We note that p Al (B) represents the probability of drawing three successive red balls 
from an urn containing one red and one white ball. We liken this problem to that of 
finding the probability of tossing three successive heads. The event space for this 
case is (H, H, //), (H, H, T), (H, T, H), (//, T, T), (T, H, H), (T, H, T), (T, T, H), 
(T, T, T), involving eight equally likely events. The a priori probability of the event 
(H, H, H} is |-. We note that, for a single toss of the com, p(H) -%. Is it reasonable 
to expect that p(H, H, H) = p(H}p(H}p(PT)t This question is answered in the next 
section. 

Problems 

1. Three urns contain, respectively, 2 red arid 3 black balls, 1 red and 4 black balls, 
3 red and 1 black ball. An urn is chosen at random and a ball drawn from it. If the 
ball is red, what is the probability that it came from the first or second urn? Ans. ^ 

% f Urn I contains two red and three black balls. Urn II contains three red and two 
black balls. A ball is chosen at random from urn I and placed in urn II. A ball is 
then chosen at random from urn II. If this ball is red, what is the probability that a 
red ball was transferred from urn I to urn II? Ans. y\ 

9.5. Independent Events. Events Not Mutually Exclusive. If we 

rewrite formula (9.5), we have 

p(A O B) = p(A)p A (E) (9.11) 

Formula (9.11) states that the probability that both A and B happen is 
the probability that A happens times the probability that B will happen 
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if A happens. What can we surmise if p(A r\ B) = p(A)p(B)t Of 
necessity, PA(B) = p(B), so that the a priori probability of B happening, 
p(B), does not depend on the event A. We say that A and B are inde- 
pendent events. Two events are independent, by definition, if 

p(Ar\B) = p(A)p(B) (9.12) 

A set of probability events, AI, A 2 , . . . , A Tl) are said to be mutually 
independent if 

p(A t C\ A,) = p(AJp(A,) i * j (9.13) 



for i, j = 1, 2, . . . , n. 

If we consider, for example, the tossing of a coin and the rolling of a 
die, we obtain the event space of 12 elements, 

(H,l) (H,2) (#,3) (ff,4) (H, 5) (H, 6) 

(T, 1) (T, 2) (T, 3) (7', 4) (T, 5) (T, 6) 

We can assign the a priori probability of ^ to each event. The prob- 
ability that a head occurs and a six occurs is taken to be ^. If we look 
upon the rolling of a die as completely independent of the tossing of a 
coin, we note that p(H) = , p(6) = , and p(77 Pi 6) = = ^ 
Assigning the a priori probability of ^ is equivalent to assuming that 
the two events are independent. 

Example 9.9. A die is rolled n times. We compute the probability that a six occur 
at least once. The a priori probability of failing to throw a six on any given toss of the 
die is -^. If we assume that each successive roll of the die is independent of the 
previous throws, the probability of not obtaining a six in n throws is () Tt . Hence the 
probability of rolling at least 1 six m n tosses is 1 (-)" For n 3, p < -$, while for 
n = 4, p > . 

Example 9.10. The game of craps is played as follows: A pair of dice is rolled, and 
the player wins immediately if a seven or eleven occurs. He loses if a two, three, or 
twelve arises on the first throw. If a four, five, six, eight, nine, or ten is thrown, the 
player continues to roll the dice until he duplicates his first toss or until a seven occurs. 
The numbers two, three, eleven, twelve are disregarded after the first toss. If the 
player duplicates his first toss before rolling a seven, he wins; otherwise he loses. We 
compute the probability that the man rolling the dice will win. At gambling estab- 
lishments an opponent of the roller of the dice (except the establishment) does not win 
if the player rolls a twelve. The probability of winning on the first toss is 

p(7) +p(ll) = A 

since there are 6 ways of rolling a seven, 2 ways of rolling an eleven, and 36 total 
possible throws of two dice. The player can also win by rolling a four and then rolling 
a four before a seven. The probability of rolling a four is ^, and the probability of 
rolling a four before a seven is . Thus the probability of rolling a four and then 
rolling another four before a seven is -/g - % = ^g. The same reasoning for the num- 
bers five, six, eight, nine, ten yields 

- 49292 
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Example 9.11. Bernoulli Trials. Let p be the probability of success of a certain 
experiment, q = 1 p the probability of its failure. Assume that the experiment is 
performed n times, the probability of success remaining constant, while the result of 
each experiment is independent of all the others. Such a sequence is called a Ber- 
noulli sequence. The simplest example is illustrated by the repeated toss of a coin. 
We determine now the probability of attaining exactly r successes in n trials. In the 
case of the coin problem a particular successful sequence would be the sequence 
H\Hi - ' H T T\L\ - T n -r if we were interested in obtaining exactly r heads. 
There are obviously n C r different sequences containing exactly r heads and n r tails. 
The probability of obtaining any particular sequence is (-g-) n . Thus the probability 
of obtaining exactly r heads in n tosses of a coin is given by n C r /2 n . Let the reader 
show that 

P r = n r p r q n - r (9.14) 

is the probability of obtaining exactly r successes in n trials for the general Bernoulli 
sequence. It can be shown that the number r which makes P T a maximum for a 
given n and p is the greatest integer less 
than or equal to (n -\- l}p. 

We now consider events which 
may not be mutually exclusive. Let 
E be an event space with subsets 
AI, A 2 , . . . , A n . The A t , i = l f 

2, . . . , n, need not be mutually 
exclusive (see Fig. 9.2). 
Certainly 



FIG. 9.2 




since there may be overlapping regions of A iy A 2 , A 3 
is easy to verify that 



Geometrically, it 



Ai\J A 2 VJ A* = AI + A 2 + A 3 - AiC\ A* - A 2 H A 3 

- A 3 n A I + A I C\ A 2 r\ A 3 (9.15) 

In AI + A 2 + A 3 we have counted AI C\ A 2 twice; so we substract 
Ai n A 2 , with a similar statement for AI Pi A 3 , A% C\ A z . In AI + A 2 
+ A s we have counted AiC\ A^C\ A 3 three times, and then we have 
subtracted AI H A 2 C\ A$ three times in A 1 C\ A%, etc. Thus we add 
AI C\ Az n A 3 to obtain (9.15). In general 



* <ssl 



(9.16) 
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Example 9.12. Balls numbered 1 to n are placed at random in n urns numbered 
1 to n, one ball to each urn. What is the probability that the number of at least one 
ball corresponds to the number of the urn in which it is placed? Let A t be the event 
of having the ith ball in the ith urn regardless of the distribution of the rest of the balls. 
Then p(A t ) = 1/n for i 1, 2, . . . , n. p(A t C\ Aj} represents the probability 
that both the ith and jth ball be in their proper urns. From (9.5), 

p(A t n A,) - p(A,)pA,(A.) 

Now pA.(At) = l/(n 1), since, if it is known that the jth ball is in the proper 
urn, there is one chance in n 1 that the ?th ball will bo in its proper urn Thus 
p(Ai C\ Aj) l/n(n 1). We note also that there arc "C 2 different A* C\ A } . 
Continuing, it is easy to see that 



P - n - _ + ____ ____ 4- 

W \2/ n(n - 1) W n(n - l)(n - 2) ^ 

i __ L_j__l __ i _i- . . . 4. (__i)n+i _i 
1 21+3! 41^ ^ ( L) n! 

Let the reader show that lim P n = (e l)/e. 

n * 

We conclude this section by considering the simple one-dimensional 
random-walk problem. Assume that a person starts at the origin. A 
coin is tossed, with the result that if a head occurs the person will move 
one unit to the right and if a tail occurs the person will move one unit to the 
left. The process is repeated n times. What is the probability that the 
final position is located at x = r? Let u be the number of moves to 
the right and v the number of moves to the left that can occur so that 
the final position will be at x = r. Then u v = r, u + v = n, so that 
u = (r + ri)/2, v (n r)/2. There are n C u different ways in which 
one can move to the right, the rest of the moves being to the left. The 
probability of ending at x r is thus n C y ( r+n )/2/2 w . This problem is 
similar to that of Example 9.4. 

Problems 

1. What is the probability that -a seven occurs at least once in n tosses of a pair of 
dice? Ans. 1 - (|) n 

2. Derive (9.14). 

3. Show that 

P[AI r\ (A 2 u A 3 )i = P<A! n AO + P<AI n A 8 ) - P(A, n A 2 r\ A 3 ) 



What does this formula reduce to if Ai, A 2 , and As are mutually independent? 
4. Show that the probability of having at least r successes in a sequence of n Ber- 



n 

noulli trials is P = } f - j p t q n ~ i . 



5. Show that > ( i ) p f (l - p)~ - 1. 
t-0 
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6. A com is tossed in a Bernoulli sequence. What is the probability of obtaining 
m heads before n tails appear? Hint: If there are at least m heads in the first m + 
n 1 tosses of the coin, the game is won. 

m-\-n I 

1 V /m+n-l\ 
^ ( r ) 

r<*m 

7. Generalize the random-walk problem to the case of two dimensions with p -| 
for each of the possible motions. 

8. A and B have, respectively, n + 1 and n coins. If they toss their coins simul- 
taneously, what is the probability that A will have more heads than B? Ans. p = -% 

9. What is the probability of getting exactly 2 sixes in three throws of a die? 



10. Coin 1 has a probability of p of getting a head, and coin 2 has a probability of q 
of getting a tail. We start with coin 1 and keep tossing it until a tail occurs, where- 
upon we switch to coin 2. Whenever a tail occurs on coin 2, we switch to coin 1, etc. 
What is the probability that the nth toss will be performed on com 1? Hint: Let P n 
be the desired probability so that P n \ = P n p -f (1 P n )</, P\ = 1. Show that 

PM-I - Pn = (p - q)(Pn - P-i) = (P - D(p - q) n+1 

Ans P H - gJlJ 



1 +q - p 

9.6. Continuous Probability and Distribution Functions. In previous 
discussions the probability event space consisted of a finite number of 
elementary events. A simple example illustrates the extension of this 
idea. We consider an idealized spinner whose pointer can assume any 
one of the directions ^ x < 2ir, x measured in radians. We say that 
there is zero probability that the direction of the pointer be less than zero 
or greater than 2ir since we are concerned only with angles between zero 
and 2w radians. The direction of the pointer has been put into a corre- 
spondence with the real-number system. The set of all possible direc- 
tions is called a one-dimensional random, or stochastic, variable, usually 
denoted by . In this example it makes very little sense to ask the 
following question: What is the probability that a random spin yields 
the direction ? #o is just one possible event of an infinite number of 
different events which may occur. It does make sense to ask the follow- 
ing question : What is the probability that the random event be less than 
or equal to x? We define the a priori distribution function F(x) as 

F(x) = for x < 

F(x) = P( g x} = - for ^ x < 2* (9.17) 

JjTT 

F(x) = 1 for x ^ 2?r 

Figure 9.3 shows a graphic representation of F(x). 
For any x and y we note that (9.17) yields 

P(xSy) = F(y) - F(x] 
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For y = x + Ax we have P(x ^ ^ x + Ax) = F(x + Ax) - F(x) = AF. 
AF is the probability that lies between x and x + Ax. We also note 
that F(x) is a monotonic nondecreasing function of x with F( <) =0, 
F(+oo) = 1. 




FIG 9.3 



For the more general case we consider a nonnegative function p(x) 
such that p(x) dx represents the probability that the random event lie 
between x and x + dx, except for infinitesimals of higher order. We 
further desire 



p(x] dx = 



(9.18) 



p(x) is called the probability function of the random variable . The 
distribution function is simply 



(9.19) 



F(x) = P( ^ x) = /"' 7>(x) dx 

J-OO 

Note that F(x) is nondecreasing since p(x) is nonnegative, with 

F(-oo) = F(oo) = 1 

dF(x) = p(x) dx is the probability that lies between x and x + dx. 
A simple example of a probability function is p(x) = - y r- > with 

7T 1 j~ X 



The nth moment of a distribution is defined by 

a n = ("^ x n p(x) dx n = 0, 1, 2, ... (9.20) 

provided the integrals exist. For p(x) = - ^ j- ^ only a exists. From 

7T A ~~f~ X 

dF(x) = p(x) dx we can write 

n / r n r!F(r} n 1 9 fQ91"! 

^n / k t *'-' v**'/ '^ "> *- j ^j ^J/.^i^l 

Equation (9.21) is the Stieltjes integral of Chap. 7. The discrete case 
can be handled by use of this integral. Let x = 1 represent the occur- 
rence of a head and x = 2 the occurrence of a tail when a coin is tossed 
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with p(l) = , p(2) = , p(x) = otherwise. Then F(x) = for x < 1, 
^0*0 = I f or 1 ^ # < 2, /^(x) = 1 for x ^ 2. We can consider that two 
point masses contribute to the probability function. From dF(x) 
for x < 1, dF = 1 at x = 1, dF = for 1 < x < 2, dF = J at a; = 2, 
dF = f or x > 2, we note that 





dF = f 

= 1 



for x < 1 
for x = 1 
for 1 < x < 2 
for x > 2 



Geometrically (see Fig. 9.4), we note that F(x) is monotonic iiondecreas- 
ing. Discontinuities in F(x) occur at # = 1, x = 2, because point masses 

F(x) 



F(x)=Q 



1 2 

Fi(i 9 4 
are situated there. To compute i, we note that 



a. = ; = = 

Had we chosen x\ = 1 , x-i = 1 , we would have obtained 

01 = (-l)i + 0)i =0 
Note that |- is the mean of x = 1 and x = 2. 

Example 9.13. TTi? Gaussian Distribution. Let us assume that a dart is thrown at 
the #?/ plane under the following assumptions: 

I. If p(x, y) dy dx is the probability that the dart fall in the area bounded by x and 
x -h dx, y and y -f- rfy, then p(x, y) depends only on the distance of (x, y) from the 
origin and is independent of = tan" 1 (y/x). Thus 



p(x, y} dy dx = q(r*)r dr dB q(x 2 
p(x, y) = q(x 2 -\- i/ 2 ) 

We assume further that p(x, y) is differeritiable. 

II. p(x, y) * as x or y becomes infinite. 

III. 1^ r p(x,y)dydx = 1. 



dx 



We attempt to determine p(x, y). First we note that 
/_ oo p ( x> ^ ^ ^ /- - P ^ X) ^ dx ' dy 
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since PI(X x + dx, < y < oo) - / p(x, y) dy dx, 

P 2 (- oo < x < oo, y ^ <rj ^ y -f dy) = / ^ p(x, y) dx dy 
and P(x ^ ^ x + dx, y ^ r, ^ y + dy) = P,P Z = p(z, y) dy dx 

/oo r oo 

p(z, ?/) dt/, S(//) = / p(x, y) dx, so that 



?Gr 2 + ?/ 2 ) (9.22) 

Differentiating (9.22) yields 



and . , constailt = fl (9.23) 

yS(j/) dy o;/2(x) dx 

From (9.23), JB(a;) = Ae f , S(y) = Be a ^ 2 , and 

p(x, y) = g(x 2 + ?/ 2 ) = (V<' i+ * f > (9.24) 

Condition II implies that a is negative If we choose a = l/2<r 2 , of necessity, 
C 1/2* from condition III. Thus 

P(X, y) = o^-, f-d^DCjrt-l-yl) ( 9 25) 

^Trer^ 
Equation (9.25) is the normal distribution of Gauss For the one-dimensional case 





0- X/27T 

where m represents a displacement from the origin Let the reader show that 

(a; - 



Thus <r 2 is the second moment of <p(x) relative to the center m. 

Example 9.14. Tchebyscheff's Theorem. Let be a random variable with a proba- 
bility function p(x). Let g() be a nonnegative function of 4, and let S be the set of 
points such that g() ^ K > 0. 5 will denote the rest of the real axis. The expected 
or mean value of g() is denned as 

E[g(&] - [ " g(x)p(x) dx 

J 00 

The probability that </() be greater than or equal to K when is chosen at random is 

simply the probability that be found in S, so that P[g() ^ K] == / p(x) dx. Now 

7*s 

- [ O g(x)p(x)dx+ Lg(x)p(x)dx 

JS JS 

g(x)p(x) dx^K p(x) dx 
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since g(x) and p(x) are nonnegative. We obtain Tchebyscheff's theorem, 



P\a(& 6 K] 5 (9.27) 

What difficulties arise if g(x) = 1 for j- irrational, </(x) = for x rational, K = ^? 
If m is the first moment or mean of and a 2 is the variance of defined by 



(x m) 2 p(x) dx 
> 

let the reader deduce the Bienayme-TehebyschefT inequality, 

P(\t ~ w ^ ka) ^ ^ (9.28) 

if we choose #() = ( ?/i) 2 > A' = A'V 2 . 

Example 9.15. 7 7 /?f Buff on, Needle Problem. A board is ruled by equidistant 
parallel lines, two consecutive lines being d units apart. A needle of length a < d is 
thrown at random on the board What is the probability that the needle will intersect 
one of the lines? Let x and y describe the position of the needle (see Fig. 9.5). 






r.JfLj?. 



FKI 9.5 

If y ^ a sin x, our situation is favorable, while if d > i/ > a sin x, an unfavorable 
case occurs We assume that x and y are independent random variables Thus 

Pi(x ^ ^ x + dx) = pz(y ^ rj ^ y -f dy) = '-f 

7T (/ 

and p(x, y) dy dx = (l/ird) dy dx. We are interested in F(0 ^ x ^ ?r, ^ ?/ ^ 
a sm x), which is defined by 



C TT fa sm x } C v r a sm x 

Jo Jo p(x > y)dydx =^d]o Jo d 



2a 



" Example 9.16. Let be a random variable with a probability function p(x). The 
probability that lies between x\ and x\ -\- dx\ as the result of a single experiment is 
P(XI) dx\. If we repeat the experiment, the probability that lies between x 2 and 
x 2 + dx 2 is p(x z ) dxi. The joint probability that both results happen as the result of 
two experiments (assumed independent) is 

pCrOpOrz) dxi dx 2 (9.29) 

Equation (9.29) is a special case of the single random variable (, rj) with probability 
function p(x, y). We note that 



/oo r * 
/ p(xi)p(x z ) dxi dx 2 = 1 
-00 7-00 
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We now ask the following question : What is the probability that, as the result of two 
experiments, x\ + xz ^ K, K a constant? In Fig. 9.6 we note that this is just the 







2 =A- X l 
FIG. 9 6 

probability that the point (x\ y #2) lie above the line x\ -\~ Xi K, so that 

P(XI -f x z ^ K) = / / p(zi)p(:r 2 ) dx-i dxi 

7- JK-xi 

From (9.30) we have 

Pfa+xt^K+dK) = f [" 

y oo ^A+d/v xi 



so that 



X! + x* K + dK) 



oo fK+dK-xi 



f oo fK+ 

= / / 

J - oo JK-JTi 



(9.30) 

(9.31) 

i dx^ (9 32) 



[ K+dK-xi 

by subtracting (9.31) from (9.30) Since / pCr 2 ) da: 2 ~ p(K x\) dK, 

J K xi 
note that the probability function for xi -f Xz = u is 



1. Show that < 



p(w) = /_ 

Problems 

i of (9.26) satisfies [ ^(x) 



(9.33) 



1. 



2. Derive (9.28). 

3. Find the four-dimensional joint probability function for the two-needle Buffon 
problem. 

4. Show that <r 2 & / (z - ra) 2 p(z) dx - a 2 - m 2 , a 2 denned by (9.20). 

./-co 

5. Let and 17 be random variables with Gaussian probability functions (m = 0). 
Show that u ~ + 17 has a Gaussian distribution. 

6. Find en, a 2 , a 3 for ^>(x) of (9.26). See (9.20) for the definition of a t , i = 1, 2, 3, . . . 

7. If is a random variable with a Gaussian distribution (m =0), find the proba- 
bility function for the random variable 2 . Hint: Let u = 2 so that 



P(u - 







^ 



P(\t\ 



-f 
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Then show that P(t ^ u ^ t + dt) = (l/<r \/2irO e~</ 2<r2 eft, so that 
1 
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for j > 



p(Q = for t g 



8. Let be a random variable with probability function p(x). Let u = w() be a 
strictly increasing (or decreasing) function of . Show that the probability function 



for the random variable u is q (u) = p(x(u)) 



du 



where x(u} is the inverse function 



of u(x). Apply this result to Prob. 7. 

9. Consider a distribution of points in the plane with a density function given by 
(9.25). At t = each point moves in a direction <p with speed V and with a proba- 
bility function p(<p) = <p/2w, ^ <f> < 2?r. Show that at time t the probability func- 
tion in space and direction of motion is given by 



1 



(,-( 1/2(T2) [ 



~ 



Show that p(r, 6, <?, t} satisfies ~ + V (pv) = witli 

v = V cos (p i + T 7 sin <p j = V cos (<p 0)e r -f- ^ sm (^> 0)e0 

9.7. The Characteristic Function. Bernoulli's Theorem. The Cen- 
tral-limit Theorem. Let J be a random variable with a probability 
function p(x). Let us assume that the Fourier integral of p(x) exists. 
Then 



c**p(x) dx (9.34) 

is called the characteristic function of p(x), with t real. We note that 
<p(0) = / p(x) dx = 1. For t real, n a positive integer, we have 

7 00 

I f x zn e* x p(x) dx ^ f x' 2n p(x) dx = a 2n 
I y-oo y -oo 

If we can differentiate inside the integral, then 



= n 



x n p(x) dx = i n a n 



(9.35) 



Hence, if we can find <p(t), it will be possible to find the nth moment 
from (9.35). 

Example 9.17. Let p(x) = e~ x for x ^ 0, p(x) =0 otherwise. Then 

-n T -^ 



= /.' 



"dt 
d*<f> 
dt* 



= -2 

t-o (1 - i 
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so that ai 1, a 2 - 2. Since 



00 

__J_ = V 



n=0 

we have ^> (n) (0) = i n n! and a n = n'. This result is also obvious since 



r(n + 1) = x n e'*dx = n' 

yo 

Example 9.18. Bernoulli's Theorem. For a Bernoulli sequence of n events we have 

P r = "p r a ~ P) n - r (9.36) 



for the probability of obtaining exactly r successes [see (9.14)] 
The analogue of (9.34) for the discrete case is 



2 (>inp > 



c*dF(x) (9.37) 

-oo 

Applying (9 36) yields 



= [pc 4 -f (1 - P)] n 

Thus >(0) = 1, ^>'(0) = ipn, <^"(0) = n(n - I)/) 2 - np Hence ai = m = p 
a 2 = n(n l)p 2 -f np, so that er 2 = o 2 w- 2 = wp(l - p) (see Prob 4, Sec. 9.6). 



Applying (9.28) with k = e \-7j-y > 0, yields 

P(l - np| ^ en) ^ ^ ^ ^ ^ n (9.38) 

since p(l p) ^ -J- for ^ p ^ 1. Formula (9.38) is equivalent to 

(9 - 39) 



Kormula (9.39) is a form of Bernoulli's theorem In other words, the probability that 
the frequence of occurrence, /n, differs from its mean value p by a quantity of 
absolute value at least equal to e tends to zero as n > however small e is chosen. 
This essentially means that the greater n is the more certain we are that /n differs 
little from p, where is the total number of successes in n trials. 

Let us return to the characteristic function. From (9.34) we note that, 
if <p(t) is known, then 

i r 

t (9.40) 
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Hence p(x) is uniquely determined if the characteristic function is known. 
Equation (9.40) follows from the Fourier integral theorem. If and 17 
are independent stochastic variables with probability functions pi(x), 
p*(y), respectively, it follows from the method of Example 9.16 that 

p(u) = I pi(x)pi(u x) dx (9.41) 

J co 

is the probability function for the stochastic variable + y- The char- 
acteristic function of p(u) is 



<p(t) = c ltu p(u) du 

J 00 

- f f x e^p 1 (x)p 2 (u - x) dx du (9.42) 

J 00 J <X> 

Now <pi() = ( e**pi(x)dx 





so that ?i(<WO = f f 00 c^ r+ ^p l (x)p^(y) dy dx 

J _ oo 7-00 

= f 00 f e ttu pi(x)p z (u - x) du dx (9.43) 

J oo ^ 00 

by letting ?/ = # + ?/. A comparison of (9.42) with (9.43) yields 

(9.44) 



provided the order of integration can be interchanged. 

Another interesting result is the following: If <pi(t), <pz(t), . . . , 
<p n (f), ... is a sequence of characteristic functions which converges to 
<p(t), obtained from a sequence of probability functions pi(x), p*(x), . . . , 
p n (x), , it may be possible that 

r-*v(t) dt (9.45) 

ao 

where <p(t) and p(r) are limits of their respective sequences. In order 
that (9.45) hold, we must have 

1 T 00 1 f" 

p(x) = lim p n (x) = Hm / e~*<p n (i) dt = - / e-*v(0 * 

7} oo n oo ' 7r J <*> **R J oo 

(9.45') 

The reader is referred to Sec. 10.22 for a discussion of this type of problem. 
Let us consider now the following example, which will illustrate one 
aspect of the central-limit theorem involving probabilities. Let be a 
random variable, and, to be momentarily specific, we shall consider the 
toss of a coin. The random variable will have the value x = 1 if a 
head occurs and the value x = 1 if a tail occurs. If we toss the coin 
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five times, we can record the numbers (xi, x^ #3, x^, x$) with Xi equal to 
1 or 1, i = 1, 2, 3, 4, 5. The set (#1, #2, #3, #4, x$) represents a point 
in a five-dimensional space. We can repeat the experiment of tossing 
five coins as often as we please and obtain a set of 5-tuples, (x\, x\, x\, 
x\, xi), i = 1, 2, . . . . We may look upon the aggregate x\, x\, xf, . . . 
as a set of results yielding information about the random variable . The 
same statement can be made concerning the set x\, x\, x\, . . . , x\, . . . , 
etc. The set of 5-tuples can also be looked upon as defining a new random 
variable. For the more general case we let p(x) be the probability func- 
tion for the random variable , and we consider the n-tuple of independent 
events (x\, x^ . . . , x n ). The probability that a point lie in the volume 
bounded by x\ and Xi + dxi, 0*2 and Xi + dxt, x n and x n + dx n is given by 

p(xi)p(x 2 ) ' ' ' p(x n ) dxi dx z ' ' ' dx n 

n 

If 8 n is the random variable N x t , an extension of Example 9.16 shows 

ii 
that the probability function for >Sf w is given by 

/> ( - I x.) 

l-l 

n-1 

= f~^ ' f~^ p(jci)p(x$ ' ' ' p (u - ^ x^ dxn-i ' ' ' dx l (9.46) 

1-1 

The characteristic function is 

^(hi 

C+** -**-> f p(x,) P(x n )dx 1 - dx n (9.47) 



by letting w = #1 + #2 + ' + x n , du = dx n , and reversing the order 
of integration. Equation (9.47) becomes 



(9.48) 
with <p(t) = / p(x)e itx dx. Let us assume that the first moment of 

y w 

p(x) is zero so that o- 2 is its second moment. Thus ^(0) = 1, ^'(0) = ^7 
^ x/ (0) = <r 2 . If <p(f) has a continuous third derivative in a neighbor- 
hood of t == 0, then 



with \a(t)\ < A^ 3 (see Sec. 10.23). 
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If T n is the random variable S n /<r \/X l e t the reader show that the 
characteristic function for T n is \l/ n (t) = $ n (<A \/n). Hence 



with \a(t/a Vn)\ < At*/<r*n*. From In (1 + z) = z + /3(z) with \p(z)\ < 
Bz z for z small, one has that 

[2 / f \~| 2 

1 - o- + ( "-7= ) = - o (9.49) 
^^ \<7 V^/ J ^ 

From (9 49) it follows that 

lim $ n (t) = e~* 2/2 

The probability function P(x) associated with the characteristic function 

e~ tz/z is 

P ^ = i T e ~'^" X di = 77^z ( '^ /2 (9-50) 



If we accept (9.45'), we have shown that if 1, 2, , n, are 
random variables with the same probability function p(x), with a mean 

n 

equal to zero and second moment equal to o- 2 , then T n = Y f/ 

t~i 
is a random variable whose probability function approaches the normal 

distribution (l/v / ^") e ~ a " 2/2 as n becomes infinite. We say that the 
sequence of random variables obeys the central-limit law. 

Problems 

1. Derive (9.49). 

2. Derive (9.50). 

3. Show that the characteristic function for Cauchy's distribution function, 



4. Let p(x) = 1 for J- ^ x ^ ^, p(x} =0 otherwise. Show that 

<p(t) = - sin g 

6. If is a random variable with characteristic function p(t) t show that e~* ta <f>(t) is 
the characteristic function for the random variable a, a = constant. 

6. Consider a sequence of 1,000 Bernoulli trials with p = ^. Show that the proba- 
bility that the experimental ratio r/n will differ from % by less than 0.01 is greater than 
or equal to f- . 
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9.8. The x 2 Distribution. Application to Statistics. From the defini- 
tion of the gamma function (see Sec. 4.16) we have 



= / e 
Jo 

/ 

a * I 

Jo 



/" & 

so that / r-r e-^y*- 1 dy = 1 a > (9.51) 

Jo r(z) 

The function 

/(*/; a, 2) = -^j c ~ al/ 2/ z ~ 1 for ?/ > 

= for T/ ^ (9.52) 

/oo 
/(?/; a, 2) r/;/y = 1. We can look 

upon/(?/; a, 2) as a probability function. Its characteristic function is 





a* 



(a - i/)T(z) Jo (a - ?/)* (1 - i//a) 

Now let be a random variable with probability function 

P(*) = (~\ 



The probability function for f 2 is P(x) = ( l/V27ro:) e~ x/2 for x > 0, 
P(x) = otherwise (see Prob. 7, Sec. 9.6). If 1, 2, . . . , n are n inde- 
pendent random variables with the same probability function p(x) given 
above, the characteristic function for the random variable 



(9.54) 



is the product of the characteristic functions for each random variable t 2 , 
= 1,2, . . . , n [see (9.44)]. Now the characteristic function for P(x) is 

-x/2 Too .-(J-i 

<S** dx = 
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The characteristic function for x 2 is 

(1 - 2^)~ /2 (9.55) 



Comparing (9.55) with (9.53), we note that z = n/2, a = , so that the 
probability function associated with the random variable x 2 ^/(//; 1, w/2), 



= ?2) zr""<r"" for , > 

= for /y (9.,%) 

The distribution function for /'(,'/) is 

* ^" 2 - 1 * 1 -"" % (9-57) 



The distribution defined by K n (x) is called the x 2 distribution, principally 
associated with K. Pearson. 

The x 2 test of significance arises in the following manner: Let be a 
random variable with a known probability function p(x). Let us divide 
the interval <*> < x < oo into m parts, say, oc < x < x^ Xi ^ x ^ 
x^ . . . , m _i ^ x < oo . We have 



with > P t = 1. Let N be the number of times we sample , the result 

T = l 

of any sample being independent of the previous samples. The expected 
number of samples which fall in the iih interval is given by NP,. If r l is 
the actual number of samples falling in the ^th interval, then 



(r, - 



certainly constitutes some measure of discrepancy between the theoretical 
and experimental results. K. Pearson found that 



^- (9-58) 

yielded a practical means for measuring the reliability of the experi- 

m 

mental results. If we let y, = (r t - NP l )/\/WP % , then X 2 = X V*, and 
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m mm 

V I/, VP* = -4- V r* ~ VN V ^ = Vtf - VN = 0. Hence the 
LI VN LI LI 

t-i t=i t=i 

random variables ?/ 4 are not independent. The y % lie in the hyperplane 

m 

/ y% \^Pi = 0, a subspace of the ra-dimensional Euclidean space. It 

t-i 

can be shown that, as N becomes infinite (N is the total number of 
samples or trials), then the distribution of x 2 approaches K m ~\(x) [see 
(9.57)]. The reader is referred to Cramer, "Mathematical Methods of 
Statistics," Chap. 30, for proof of this statement. Thus 



lim P( x 2 ^ xS) = --- ----- x y (m ~ z ^e-v"dy (9.59) 

* 




One can determine the value of the integral of (9.59) from a table of the 
X 2 distribution. It is important to note that (9.59) is independent of 
the original probability function p(x). 

Example 9.19. A coin was tossed 5,000 times and heads appeared 2,512 times. 
Under the assumption that the coin is "true," we have p(H) = -, p(T) -%, m = 2, 
N = 5,000, Pi - P 2 - |; n = 2,512, r 2 = 2,488. Hence 



2 = 
* " 



(2,512 - 2,500) 2 (2,488 - 2,500) 2 

2^500 + " 2,500 

0.115 



In a table of the x 2 distribution it is found that the probability that x 2 exceed 3.841 
is 0.05. Since 0.115 < 3.841, we feel that there is no inconsistency in the assumption 
that the coin is true. The 5 per cent level is taken as a fairly significant level. In 
the tables one also finds that P(x 2 ^ 0.115) is about 0.73, which means that we have 
a probability of about 73 per cent of obtaining a deviation from the expected result at 
least as great as that actually observed. We are therefore not too worried about 
the fact that experimentally we obtained x 2 = 0.115. 

Problems 

1. A coin is tossed 5,000 times, with heads occurring 3,000 times. Evaluate x 2 for 
this experiment. Would you be suspicious that the coin is "true"? 

2. A die is rolled 6,000 times with the following occurrences: the number of times 1, 
2, 3, 4, 5, 6 occurred, respectively, was 1,020; 1,032; 981; 977; 1,011; 979. Show that 
X 2 = 2.876. What is your opinion as to the "truthness" of the die? 

3. A random variable has the probability function p(x) = (l/\/27r)e~ x * /2 . If in 
1,000 experiments one obtained 534 values of x < and 466 values of x ^ 0, would 
you consider that the experiment was biased? 

9.9. Monte Carlo Methods and the Theory of Games. The method of 
Monte Carlo is essentially a device for making use of probability theory 
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to approximate the solution of a mathematical or physical problem. A 
few examples will illustrate the method. 

Let p(x) be the probability function of a random variable, , defined 

on the range a ^ x ^ 6, with / p(x) dx = 1, p(x) = for x outside the 

Ja 

interval (a, fe). The expected, or mean, value of /(x) has been defined as 



/= f* f(x)p(x) dx 



A sequence of values xi, Xi, . . . , x n , is chosen at random subject to the 
condition that the probability that the random variable lie between x 
and x + dx be given by p(x) dx. For n large one hopes that 



(x>) (9.60) 

rb 

If we wish to find an approximate value of / F(x) dx, we note that 

Ja 



I" 

Ja 



(9.61) 



so that/(x) = F(x)/p(x). A simple choice for p(x) is p(x) = 1/(Z> a). 
The choice p(x) = I/ (6 a) implies that one can construct an experi- 
ment such that all numbers on the interval (a, 6) have equal likelihood of 
occurrence. It is extremely doubtful that such an experiment can be 
devised. The toss of a single coin can be used to generate the number 
zero if a head occurs, the number 1 if a tail occurs. An infinite number 
of ordered tosses of a single coin, or an infinite number of ordered coins 
tossed simultaneously, would yield a sequence ao, ai, 2, . . . , a n , . . . , 
with a n = or 1 for all n ^ 0. This, in turn, yields the number 



with ^ x ^ 1. Mathematicians have constructed tables of random 
numbers to avoid the cumbersome process of obtaining a random vari- 
able by experiment. Let the reader deduce a method for obtaining 

N 



by use of a bowl, containing an equal number of red arid white 
balls, and a scoop with N + 1 ordered holes. The author chose 25 num- 
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25 

bers from a table of random numbers and obtained -%-% \ %* = 0.5030, 

reasonably close to I x dx. 

We now consider a less trivial example. The game to be played con- 
cerns three spinners (see Fig. 9.7). If a pointer comes to rest in the area 






III 



designated by p, ; we move from the &th spinner to the jth spinner. If 
the pointer comes to rest in the area x % , i = 1, 2, 3, the game is over. 

3 

The areas are to be unity so that x l = 1 Y p l3 , i = 1, 2, 3, and the 

;-i 

p's and z's represent probabilities. We ask the following question: If 
we begin the game at spinner I, what is the probability, /n, that the 
game will end at spinner I? We can be successful as follows: 

1. Xi occurs on the first spin. 

2. pn occurs?! times in succession, and then Xi occurs, n = 1,2, 3, .... 

3. ^12 occurs followed by the probability, / 2 i, that if we are at spinner II 
the game will end at spinner I. 

4. pa occurs n times in succession, n ^ 1, and then (3) occurs. 

5. Replace pn and/ 2 i of (3) and (4) by p i3 and/ 3 i. 
Let the reader show that 

/ll = (Xi + pnXi + P 2 U XI +') + (P12/21 + P11P12/21 

+ PiiPi2/2i +') + (pia/ai + Pnpnfn + Pupnfai + ' ' ') 

X ~ 



1 - Pn 
By the reasoning used above one can readily show that 



i = 1, 2, 3 

(9.63) 
i= 1, 2, 3 
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From (9.63) one obtains 

(Pll 1)/12 + P12/22 + Pi 3/32 = 
P21/12 + (p22 - l)/22 + P23/32 = ~^ 2 
P31/12 + p,{2/22 + (P,I3 - l)/32 = 

If we solve for/i 2 , we have 
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Pl2 


Pl3 




Xz 


P22 - 1 


P2'l 


T. P] ~ 





P32 


P33 1 


_ * 2 P32 \ 


Pn - 1 


Pl2 


Pis 




P21 


P22 1 


P23 




PSI 


P32 


Pn 1 





Pl3 



so that /i2/#2 is the inverse element of 3021 in the matrix 



Pll - 

P21 
P31 



Pl2 

P22 

P32 



Pl3 

??23 
P33 - 



(see Sec. 1.3). Let the reader show that fzz/xz is the inverse element 
of p 2 2 for ||A||, etc. Thus probability theory can be applied to approxi- 
mate the elements of the inverse matrix of a given matrix. The quanti- 
ties f l} , i, j = 1, 2, 3, are obtained by experiment. 

An analogy exists between the diffusion process of heat motion and the 
two-dimensional random-walk problem. Suppose that a particle located 
at (x, y) moves to one of the four positions (x + 1, ?/), (x 1, y), (x, 
y + 1), (x, y 1), with equal probability, p = |. What is the proba- 
bility P(x, y, t) that a particle will arrive at (x, y) after t steps if it starts 
from the origin (0, 0) ? For a particle to arrive at (x, y) in t steps, it is 
obvious that it will have had to arrive at one of the four positions (x, 
y - 1), (x, y + 1), (x - 1, y), (x + 1, y) m t - 1 steps. Thus 

P(x,y;t) = ^P(x,y - 1;<- 1) +P(x,y + I;/- 1) 

+ P(x - 1, y; * - 1) + POr + 1, y; t - 1)J (9.64) 

Let us subtract P(x, y\ t 1) from both sides of (9.64). Then 
P(x,y\t) 



P(x,y;t - 1) 

l,i/;J- 1) - 2P(x,y;t - 1) 
+ %P(x, y+l]t-\) - 2P(x, y] t - 1) + 



ar - l,i/;f - 1)] 
- 1 ; t- 1)] (9.65) 



In Sec. 10.13 one notes that, if f"(x) is continuous f or a g x ^ />, then 

2h) - 2f(x + h) + f(x) 



f"(x) = linr 



(9.66) 



368 



ELEMENTS OF PURE AND APPLIED MATHEMATICS 



[see (10.22)]. In the calculus of finite differences the expression [f(x + 2) 
2f(x + 1) + f(x)] represents the second difference of /(#). Thus 



Af(x + 1) = /(* + 2) - f(x + 1) 



(9.67) 



2) - 



Thus (9.65) may be written 



AJP) 



which may be compared with the diffusion equation 



(9.68) 



(9.69) 



A very fine subdivision of space and time must be used, however, in order 
to freely interchange (9.68) and (9.69). One traces the histories of a 
large number of particles with initial lattice distributions to obtain the 
distributions after a given number of steps. These histories yield an 
approximate solution to (9.69). 

We conclude this section with a brief discussion of the theory of games. 
A simple example illustrates the basic factors encountered in the theory 
of games. We consider a two-person zero-sum game as exemplified by 
the square array given by (9.70). 



(9.70) 



X and Y choose a row and column, respectively, and the choice of each is 
made without any express knowledge of the other's choice. The number 
in the tth row and j'th column (the choices of X and F, respectively) 
designates the number of units that Y must pay X. If a tj < 0, then 
X pays Y an amount \a %J \. The total sum earned by X and Y is zero, 
and it is in this sense that we have a two-person zero-sum game. We 
assume that X will attempt to earn as much as possible and that Y will 
attempt to hold his losses to a minimum. We see that in this particular 
game the possible courses of action of X and F, the consequences of these 
actions, and the objectives of X and F are fully known. The theory of 
games seeks to analyze objectively the conflict of X and F and, moreover, 
attempts to determine an optimal course of action for each player. 



\ Y 
* \ 


1 


2 


1 


-6 


2 


2 


5 


4 
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The game given by (9.70) is very easily analyzed. If X chooses row 1, 
the least he can gain is 6. If he chooses row 2, the least he can gain is 
4 units. Since 4 > 6, it is obvious that X will always gain at least 
4 units by choosing row 2, no matter what the choice of F. Since Y 
knows that X will always choose row 2, of necessity, Y will always choose 
column 2 to hold his losses to a minimum. Since X and Y will always 
choose row 2 and column 2, respectively, with probability 1, we say that 
X and Y use pure strategies. 

Let us note the following : Let f(x, y) denote the quantity in row x and 

column y. The smallest number in row x is designated by rnin f(x, y). 

y 
The largest of these numbers is written 

U = max min f(x, y) (9.71) 

x y 

Thus U = min f(x Q , y) ^ min f(x, y) for all x. Let the reader deduce 

y y 

that the choice of x = XQ by X guarantees that X will always earn at 

least U units. Let us determine the maximum amount Y can possibly 
lose if he plays wisely whatever the action of X. The largest number in 
row y is designated by max f(x, y). The smallest of all such numbers 

X 

obtained by varying y is 

F = min max f(x, y) (9.72) 

y x 

Thus V = max f(x, yd) ^ max f(x, y) for all y. Let the reader deduce 

X X 

that the choice of y y$ will enable F to hold his loss to an amount at 
most V. For the square array given by (9.70) we have 

U = V = /(2, 2) = 4 

and it is because of this that X always chooses the second row and 
F always chooses the second column. Let us note that the element 
/(2, 2) = 4 is the minimum element of the second row and the maximum 
element of the second column. It is for this reason that /(2, 2) is called 
a saddle element. Let the reader deduce that the existence of a saddle 
element implies that U = V. In any case one always has U ^ V. The 
proof proceeds as follows: Let g(x) = min f(x, y), U = g(xd) = max g(x), 

y x 

h(y) = max/Cc, y), V = h(yd) = min h(y). Then 



U = g(xd) = max g(x) = max [min/(x, y)] = min/(z , y) 

g /(x , 2/ ) y ^ max/(x, y!) = h(y) = V (9.73) 

X 

The reader is urged to verify every step of (9.73). 
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We now consider a case for which U < V, given by (9.74). 



1 



-1 



2 



(9.74) 



Let the reader note the noncxistence of a saddle element with 

U = 2 < V = 3 

Thus X is sure to win 2 units per game if he always chooses x = 1, while 
Y can never lose more than 3 units if he always chooses y = 1. If Y 
were to notice, however, that X always chose x 1, then it would be 
expedient for Y to choose y = 2. Can it pay for X to change his strategy 
so that he can realize more than 2 units per game on the average? The 
answer is "yes"! Let us suppose that X and Y use mixed strategies in 
the following sense: X chooses x = 1 with probability p and chooses x 2 
with probability 1 p; Y chooses y = 1 with probability q and chooses 
y = 2 with probability 1 <? The total expectation of X is 

B = 3p<? + 2p(l - g) + (-1)0 - p)g + 4(1 - p)(l - q) 

= q(6p - 5) + 4 - 2p (9.75) 

X now reasons as follows: What mixed strategy should I use in order 
to guarantee a given expectation no matter what the mixed strategy 
employed by F? If ftp 5 > or p > |, then E is a minimum for 
q = 0, so that E = 4 2p < . On the other hand, if p < f, 7 is a 
minimum f or # = 1 so that E = 4p 1 < . For Op 5 = or p = f 
we have J = independent of q. Thus, no matter what the strategy 
of 7, X can be assured of earning of a unit per game by mixing his 
strategy. X chooses row 2 if a six occurs on a die and otherwise chooses 
row 1. Let the reader show that Y can choose a mixed strategy such 
that he will never lose more than - of a unit per game no matter what 
the strategy of X. 

Let us now consider the following game exemplified by (9.76), with 
a > 0, 6 > 0: 



(9.76) 
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Let pi y pz, 1 (pi + p 2 ) be the probabilities associated with the choices 
of x = 1, 2, 3, respectively, and let gi, g 2 , 1 (gi + #2) be the proba- 
bilities associated with the choices of y = 1, 2, 3, respectively. The 
expectation of X is 

E = pi(gi - g 2 ) + p2( g\ + (72) 

+ [1 - (Pi + P)][foi + <? 2 ) - 5(1 - (0! + g 2 ))] 
= ?I[PI - P2 + (a + 5)(1 - pi - p 2 )] 

+ gj-pi + p 2 + (a + 5)(1 - pi - p 2 )] - 5(1 - pi - p 2 ) (9.77) 

Let us examine E of (9.77). For p\ = p 2 = | we have E = Q inde- 
pendent of gi and g 2 . Thus a mixed strategy occurs for X such that 
X cannot lose or gain. Y has a good strategy (pure) with q\ = g 2 = 0, 
g 3 = 1 gi g 2 = 1. Why is it obvious that the only good strategy 
for X is the mixed strategy discussed above? However, we can show 
that good strategies exist for Y other than q\ g 2 = 0, ga = 1. We can 
write 

E = pj(l - a - B)qi -(!+ + )g + 5] 

+ P2[-(1 + + %i + (1 - a ~ 5)g 2 + 6] + (a + d)( qi + g 2 ) - 5 

It is obvious that E will be independent of p\ and p 2 if the coefficients 
of pi and pz are zero. The solutions of these two equations yield 
q\ = g 2 = d/2(a + 5), which in turn yields E so that Y can suffer 
no loss for this choice of qi and q%. Thus a good strategy which is mixed 
exists for Y. Let the reader show that, if gi = g 2 < d/2 (a + 6), then 
E = (1 pi p 2 )[2(a + 5)gi 6] ^ for all choices of pi and p 2 , so 
that an infinite number of good mixed strategies exist for Y. 

Problems 

1. Generalize the results of (9.63) for the case of n spinners. 

2. Let f(x) be a continuous rnonotomc increasing function defined for ^ x ^ 1. 
If a random variable be chosen from (0, 1), show that the probability that/() ^ y Q , 
/(O) ^ 3/0 ^ /(I) is given by p = f~ l (yo), so that/(p) = j/ . Use this method to find 
an approximate value of \/2 by use of a table of random numbers. 

3. Let us consider the plane of a two-dimensional random-walk problem with 
boundary points at (xi, ?A), i 1, 2, . . . , r. To each point (#, f/ t ) we associate a 
value U(xi, y % ). Let V(x, y) be the expected value of a particle whose motion starts 
at (x t y} and eventually ends at one of the boundary points. Show that 



with P t the probability that the walk terminates at (x vy i/ t ) if it begins at (x, y). Also 
show that 



1, y) + F(x - 1, y) + F(x, y + 1) + V(x, y - 1)] 

^ 2 F d*V 
and compare this result with -^ + - - 0. 
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4. Show that the game of "paper, rock, and scissor" is exemplified by the matrix 
below. What strategies should be used by X and F? 



\ Y 

* v ^ 
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CHAPTER 10 

REAL-VARIABLE THEORY 



10.1. Introduction. We desire to preface the study of the real-number 
system by first introducing a brief view of our philosophy of mathematics 
and science. Historians are generally in agreement that in many ways 
Greek mathematics was the precursor to modern mathematics. One 
begins the study of geometry with a set of postulates or axioms along 
with some definitions of the fundamental entities in which one is inter- 
ested. From this beginning one deduces through the use of Aristotelian 
logic the many theorems concerning triangles, circles, etc. To the novice 
the axioms appear to be self-evident truths. We do not hold to this view : 
a postulate is nothing more than a man-invented rule of the game, the 
game in this case being mathematics. Actually the mathematician is 
not interested in the truth, per se, of his postulates. The postulates of 
a mathematical system can be chosen arbitrarily subject to the condition 
that they be consistent with each other. The proof of the self-consistency 
of the postulates is no easy task, if, indeed, such proofs exist. A dis- 
cussion of such matters lies beyond the scope of this text. 

Finally, we should like to add that any theorems deducible from the 
postulates are a consequence of the postulates themselves. We admit 
this is a trivial and obvious remark. Too often, however, the scientist 
forgets this fact. He is prone to believe that the laws of nature are dis- 
covered. It is our personal belief that this is not the case. We lean 
toward the more esoteric point of view that the so-called laws of nature 
are invented by man. Newton described the motion of the planet 
Mercury relative to the sun in a fairly accurate manner by use of f = ma 
and f = (GmM/r*)r. Because of this one feels that these two equa- 
tions are laws of nature discovered by Newton. On the other hand, by 
postulating that a gravitational point mass yields a four-dimensional 
Riemannian space and that a particle moves along a geodesic of this 
space, Einstein also obtained the motion of Mercury relative to the sun. 
It is well known that Einstein's predicted motion of Mercury is more 
accurate than Newton's predicted motion. Are we to assume that from 
the time of Newton to Einstein we had a "true" law of nature and that 
now we discard this law and accept Einstein's theory as the truth? It is 
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just a matter of time before a physicist will invent a new set of postulates 
which will explain more fully the physical results obtained by experiment. 
Moreover it is not our belief that we continually approach the true laws 
of nature. 

10.2. The Positive Integers. We begin a study of the real-number sys- 
tem by first discussing the positive integers from a postulational point of 
view. The positive integers will be undefined in the sense that they can 
be anything the reader desires subject to the condition that they satisfy 
the postulates set forth below. We shall attempt to be rigorous, but 
shall not be as rigorous as might be possible. First we do not feel quali- 
fied to attempt this task, and second it would require a treatise in itself 
to elaborate on the logical and philosophical aspects and implications of 
the material upon which we shall discourse 

The positive integers have been characterized by Peano through the 
following postulates: 

TV- There exists a positive integer, called one, and written 1. 
P b : Every positive integer x has a unique successor x f . 
P c : There is no positive integer x such that x' = 1. 
Pd'. If x' = ?/, then x = y. 

P e : Let 8 be any collection of positive integers. If S contains the 
integer 1, and if, moreover, any integer x of 8 implies that x' is in S, then 
S is the complete collection of integers, /. 

The postulate P a guarantees that there is at least one member of our 
set of positive integers so that we may all agree that there is something 
we can talk about. The equal sign ( = ) involved in the relation x = y 
of Pd means that x and y are identical elements in the sense that we may 
replace x by y or y by x in any operation concerning these integers. The 
postulate P e differs in some respects from the other postulates. P e is a 
rule for determining when we have the complete set of integers at hand, 
and is used chiefly m deducing new laws which the integers obey. 

As postulated above, the integers could be any sequence (0,1, a*, . . . , 
a n , . . . ) which the reader has encountered in calculus courses. It is in 
this sense that the integers are undefined. It is obvious that more will 
have to be said before we can identify the collection of integers defined 
above with the ordinary integers of our everyday working world. We 
shall accept as intuitive the notion and idea of a set, or collection, of 
objects. The Peano postulates will make themselves clearer to the stu- 
dent as he proceeds to read the text. 

The operation of addition (+) relative to the positive integers is defined 
by the postulates 

A a : a' = a + 1 

A b : a + V = (a + &)' 

The postulate Ab implies that a + b is an integer (closure property). 
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We are now in a position to deduce some new results concerning the 
positive integers. 

THEOREM 10.1. Addition is associative (a + 6) + c a + (b + c). 
We let S be the collection of integers x,y, z, . . . which satisfy Theorem 
10.1 for all a and b. Thus x is in S if (a + 6) + x = a + (b + x) for all 
integers a and b. We first show that 1 is an element of S. Now 

(a + 6) + 1 = (a + bY from A a 

= a + V from A b 

= a + (b + 1) from A a 

so that 1 belongs to R. Now let a* be any element of S. This means 
that (a + b} + x = a + (b + x) for all a and 6. Now 

(a + 6) + x 7 = [(a + 6) + x] 1 from 4 fc 

= [a + (b + #)]' since x is in /S 

= a + (b + x)' from A a 

= a + (b + x') from ^1 6 

so that x' belongs to S whenever x is in S. From P $ is the complete 
set of integers, S s= /. Q.E.D. 

THEOREM 10.2. Addition is commutative a + b = b + a (see Prob. 1). 

THEOREM 10.3. Law of Cancellation. If a + b = a + c, then /> = c. 
We proceed by induction on a. First, if 

1 + b = 1 + c 

then 6 + 1 = c + 1 from Theorem 10.2 

6' = c' from A a 

b = c from P d 

Hence 1 belongs to S. Now let x be any element of S. This means that 
whenever x + b = x + c then b c. Now assume x f + 6 = #' + c. 
Then 

6 + a:' = c + #' from Theorem 10.2 

(b + 3)' = ( c + x)' from ,4 b 
b + x = c + x from P d 

x + 6 = x + c from Theorem 10.2 

6 == c since x is in /S 

Thus x' belongs to S whenever x belongs to S. From P e , S s= /. Q.E.D. 

THEOREM 10.4. If b + a = c + a, then b = c (See Prob. 2). 

In what follows we shall use the symbol (=>) to mean "implies." 
Thus Theorem 10.4 may be written & + a = c + a=>6 = c. Implica- 
tion in both directions will be represented by (<=). Thus 

b + a = c + a <=>& = c 
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The reader should realize that b = c=*b + a = c + a does not mean 
that we have introduced the axiom about adding equals to equals. All 
we have done to the expression b + a is to replace b by c (permissible 
from the definition of equality of two integers). 

Equality ( = ) belongs to the class of relations, R, called equivalence 
relations. We note that 

a = a 

a = b <=$ b = a 
a = b,b = c=*a c 

In general a relation R is called an equivalence relation relative to 
a set of elements (a, b, c, . . . ) if 

R\\ aRa reflexive 

R% : aRb^bRa symmetric 

R- A : aRb, bRc =^> aRc transitive 

An equivalence relation R enables one to distinguish between various 
classes of elements. For example, we say that aRb holds if and only 
if a and b yield the same remainder when divided by 2. Thus any integer 
belongs either to the class of even integers or to the class of odd integers. 
The reader should check that the elements of the class of even integers 
satisfy R^ R^ R-*. We write aRb <=> a = b mod 2. 

Now we introduce multiplication by the two postulates given below: 

M a - a - 1 a 

Mb' a - b f a b + a 

Mb assumes that a - b is an integer for all a and b. We shall omit the dot 
whenever it is convenient to do so. Thus Mb can be written ab f = ab + a. 
THEOREM 10.5. Multiplication is left-distributive with respect to addi- 
tion, a(b + c) = ab + ac. We prove this theorem by induction on c. 
Let S be the collection of integers (x, y, z, . . . ) such that 

aft + x) = ab + ax 
for x in S and for all a and b. First 

a(b + 1) = ab' = afr + a = a?> + a-l 
from A , M b . Thus 1 is in S. Now let x be in S. We have 

a(b + x') = a(b + x) f from A b 

= a(b + x) + a from Mb 

= (ab + ax) + a since x is in S 

ab + (ax + a) from Theorem 10.1 

= ab 4- ax' from Mb 

so that x' is in S if x is in S. Thus S = I. 
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THEOREM 10.6. 1 a = a (see Prob. 3). 

THEOREM 10.7. Multiplication is associative, (ab)c = a(bc) (see Prob. 
4). 

THEOREM 10.8. Multiplication is commutative, ab = ba (see Prob. 5). 

THEOREM 10.9. Multiplication is right-commutative, 

(b + c)a = ba + ca 

(see Prob. 6). 

DEFINITION 10.1. If a b + c, we say that a is greater than b or 
that b is less than a, written a > b or b < a, respectively. We can state 
four important ordering theorems regarding inequalities: 

O a '. a, b given =>a = 6ora>6ora<6 

Oi\ a>b,b>c=*a>c 

O c : a>b=>a + c>b + c 

O d : a > b =* ac > be 

We leave these theorems as exercises for the reader (see Probs. 7 and 12). 
A set obeying O a is said to be totally ordered. 

To place the integers in the realm of our everyday working experiences, 
we now postulate that the integer 1 , which occurred in the first of Peaiio's 
postulates, P , will be the adjective which reasonable, sane, prudent 
human beings attach to single entities when they speak intelligently of 
one book, one day, one world, etc. Of course the number 1 as given in 
Peano's postulates is a noun, not an adjective. B. Russell associates 
"one" with the class of all single entities. Trouble occurs, however, 
when one tries to define a class of elements. 

The successor of one, namely, 1' = 1 + 1, will be called the integer 
two, written 2, and addition will again mean what we wish it to mean, 
namely, that one book plus one book implies two books. We now feel 
free to speak of the class of positive integers 7(1, 2, 3, . . . , n, . . . ). 

Problems 

. 1. Prove Theorem 10.2. Hint: First prove that a -f- 1 = 1 -f a for all a. 

2. Prove Theorem 10.4. 

3. Prove Theorem 10.6. 

4. Prove Theorem 10.7. 
6. Prove Theorem 10.8. 

6. Prove Theorem 10.9. 

7. Prove ordering theorems Ot>, O c , O d . 

8. Prove that 1 a = a. 

9. Prove that b'a ba -f- a. 

10. Show that 1 ^ all integers a. Hint: Let S be the class of all integers x such that 
x ^ 1. Then show that S = /. 

11. If a' > b, show that a > b or a = 6. 

12. Using the results of Probs. 10, 11, prove O a . 
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13. Prove that every nonempty set of integers contains a smallest integer. First 
define what is meant by a smallest integer of a collection of integers. Also use the 
results of Probs. 10, 11. 

10.3. The Rational Integers. An extension of the positive integers 
which includes the zero element and the negative integers is obtained in 
the following manner: Given any pair of positive integers, we postulate 
the existence of a difference function d which operates on the positive 
integers a, b subject to the condition that 

d(a, b) = d(.r, //) (10.1) 

if and only if a + y b + x. Equality as defined by (10.1 ) certainly is 
an equivalence relation. We note that the above definition depends only 
on addition of positive integers (Sec. 10.2). 

Let us see what properties are possessed by d. Obviously 

d(a, a) = d(fe, b) 

Moreover d(a + c, b + c) = d(a, b). If we desire that d(a, b) behave 
like b a of ordinary arithmetic, of necessity we are forced to postulate 
that addition be defined by 

d(a, b) + dfcr, y) = d(<i + x, b + y) (10.2) 

We now note that 

d(a, 6) + d(c, c) = rf(a + c, b + c) = d(a, b) (10.3) 

from (10.2) arid (10.1). Hence d(c, c) has the property of being a zero 
element for the operation of addition ( + ). 

We define multiplication by the following rule: 

d(a, b)d(x, y} = d(ay + bx, by + ax) = d(x, y)d(a, b) (10.4) 
What of the number d(a, a + 1)? We see that 

d(x, y)d(a, a + 1)-= d(xa + x + ya, ax + ya + y) ( . 
d(x, y) = d(a, a + l)d(x, y) (L ^ } 

so that d(a, a + 1) has the unit property as regards multiplication. 

It now can easily be shown that the collection of elements d(l, x + 1) 
when x ranges over the set of positive integers will satisfy all the Peano 
postulates outlined in Sec. 10.2. The set of numbers d(a, b) contains the 
positive integers as a subclass and in its totality represents the class 
of rational integers (positive integers, negative integers, and the zero 
element). We let the reader show that the zero element d(a, a) has the 
properties 

d(a, a)d(x, y) = d(a, a) nn . 

d(x, y) + d(y, x) = d(a, a) uu ' b; 
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We say that d(y, x) is the negative of d(x, ?/), and conversely. We may, 
if we please, write d(a, a) = 0, d(l, a + 1) = a, d(a + 1, 1) = a, 
d(a, 6) = 6 a. The ordering postulates can easily be extended to 
cover the new collection d(a, 6). We say that 

d(o, 6) > d(x, y) 
if and only if 

b + x > a + y 
Let the reader show that 

d(a, b) > d(x, y) => d(a, 6) + d(u, v) > d(x, y) + d(u, v) 

d(a, b) > d(x, y} => d(a, 6)d(z, w + z) > d(x, y)d(z, u + z) (10.7) 

d(a, 6) > d(x, i/) => rf(x, y)d(u + z, z) > d(a, b)d(u + z, z) 

What is the ordinary meaning of (10.7)? 

Let us note some properties of the rational integers as outlined above. 
First we have a set of elements called the rational integers. Associated 
with this set of elements are two operations called addition and multipli- 
cation. The elements satisfy the following properties relative to the 
operations defined above: 

1. Closure properties: a + b, ab arc elements of our set when a, b are 
elements of the set. 

2. Associative laws: a + (b + c) = (a + 6) + c, a(bc) = (ab)c. 

3. Unit elements: a + = a = + a, a-1 = = 1 a. 

4. Distributive laws: a(b + c) = ab + ac, (a + b)c = ac + be. 

5. An inverse element exists for each element relative to addition, 
that is, a + ( a) = ( a) + a = for all a. 

We say that the rational integers form a ring with unit element 1. We 
also note that, if ab = 0, then a = or b = 0. Why? 

We give an example of a ring such that ab = does not imply a = 
or b = 0. Such rings are said to have zero divisors. Let us consider the 
six following classes: Into class one we place all integers which yield a 
remainder 1 upon division by 6. Class two contains all integers which 
yield a remainder 2 upon division by 6. Class y has the obvious property 
that it consists of those integers which yield a remainder j upon division 
by 6. We represent these various classes by the symbols 1, 2, 3, 4, 5, 0. 
Addition and multiplication are defined in the ordinary way. Thus 

2 5 = 10 = 4 mod 6, since 10 yields a remainder of 4 when divided by 6. 

3 + 4 = 7 = 1 mod 6, etc. It is obvious that 2-3=0 mod 6, but 
neither 2=0 mod 6 nor 3=0 mod 6. We have constructed the ring of 
integers modulo 6. 

10.4. The Rational Numbers. We can extend the ring of rational 
integers by postulating the existence of a rational function /?(a, b) where 
a, 6 are any two rational integers, a ^ 0. We postulate that R is to 
have the following properties: 
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R(a, b) = R(x, y) *=> bx = ay 

R(a, b) + R(x, y) = R(ax, bx + ay) 

R(a, b)R(x, y) = R(ax, by) (10.8) 

R(a, b) > R(x, y) <^bx > ay if ax > 

R(a, b) > R(x, y) *=* bx < ay if ax < 

We leave it to the reader to show that the set of elements 72(1, b) 
has all the properties of the set of rational integers b. For all practical 
considerations we may wiite 7?(1, b) == ?>, and we shall obviously write 
R(a, b) = b/a. The set of rationals has a further property not possessed 
by the ring of rational integers. For every R(a, b), b ^ 0, a ^ 0, there 
exists a rational R(b, a) such that 

7?(a, b)R(b, a) = R(ab, bo) = /?(!, 1) = 1 

Thus every nonzero element has an inverse with respect to multiplication. 
We say that the rationals form a field. 

The reader should have no trouble showing that the integers modulo 7 
form a field. The inverse of 2 is 4 since 2-4 = 8 = 1 mod 7. 

The rationals form a totally ordered field. We feel, however, that 
the rationals are not complete in the sense that we should like to deal 
with numbers like \/2, e, w, etc. For the irrationality of \/2> see 
Sec. 10.5. We shall have to extend the field of rationals. This will be 
done in Sec. 10.6. 

10.6. Some Theorems Concerning the Integers 

THEOREM 10.10. Let /, g be positive integers. There exist integers 
h, r such that / = gh + r, ^ r < g. Our proof is by induction on /. 
Let/ = 1. Then 

1 = 1.1+0 if g = 1 so that h = 1, r = 
1 = 0-0 + 1 if g > 1 

Now assume Theorem 10.10 holds for /. Then 

/ = gh + r ^ r < g 

and /+ 1 = gh+ (r + 1) 

Since r < g, we have r + 1 < g or r + 1 = g. If r + I < g, the theo- 
rem holds. If r + 1 = g, then/ + 1 = g(h + I) + and Theorem 10.10 
holds. From P e , the theorem is true for all integers /. We leave it as 
an exercise to show that the ft, r of Theorem 10.10 are unique. 

THEOREM 10.11. We now derive a theorem regarding the greatest 
common divisor of two integers a, b. Consider the collection of integers 
(>0) of the form ax + by > 0. Since every collection of positive inte- 
gers has a least integer, there exist integers xo, 2/0 such that 

ax + fa/o = d > 



REAL-VARIABLE THEORY 381 

is the least integer of our collection. We now show that d is the greatest 
common divisor. First assume that d does not divide a. Then 
a = ds + r,Q<r<d from Theorem 10.10. Thus 

ax s + by s = ds = a r 

and (1 Xos)a + ( y s)b = r, a contradiction since r < d, unless r = 0, 
in which case d divides a, d/a. Similarly d/b. Now any other divisor, 
d\, of a and b must divide ax Q + by Q (why?), and hence di/d. Therefore 
d is the greatest common divisor of a and 6, written d = (a, b). As an 
immediate corollary, if a and b are relatively prime, d = 1 and there 
exist integers x, y such that ax + by = 1. 

THEOREM 10.12. If (a, 6) = 1 and a/be, then a/c. 

Proof, ax + by I since (a, b) = 1, so that axe + fa/c = c. But 
6c = ad since a/be. Hence a(xe + yd) = c, which in turn implies that 
a/c. 

THEOREM 10.13. The Fundamental Theorem of Arithmetic. An integer 
N can be written uniquely as a product of primes. 

Proof. Assume 

N = p^ - - p r = ?i02 q 8 



It is obvious that we can consider the p t , q } as primes since a nonprime 
factor can be broken into further factors. Continuing this process 
eventually reduces all factors of N into primes. Furthermore this can 
}>e done only a finite number of times since all primes are greater than 
or equal to 2. From above p\/q\qi ' ' ' q&. If (pi, #1) ^ l,thenpi = q^ 
Why? If (pi, qi) = 1, then pi/q 2 q* ' ' q s from Theorem 10.12. We 
continue this process until eventually p\/q^ for an i = 1, 2, . . . , s. 
Since </ t is assumed prime, we must have p^ = q t . We now cancel these 
equal factors and continue our reasoning in the same manner. In this 
way we obviously exhaust all the p % and #,. Thus, except for the order, 
the number N is factored uniquely as a product of primes. 

THEOREM 10.14. The \/2 is irrational. 

Proof. Assume \/2 rational so that \/2 = p/q. From Theorem 
10.13 we have p = pip z ' ' p r , q = q\q% ' ' ' <?, with the p t , g, unique. 
Thus upon squaring we have 

N = 2qiqiq 2 q2 ' ' ' q s q* = Pipip^p* ' ' ' p r p r 

Since the prime factor 2 must occur an odd number of times on the left 
and can occur only an even number of times on the right, a contradiction 
occurs. Q.E.D. 

Problems 

1. Show that the collection of elements d(a t b) satisfies the Peano postulates as the 
elements a, 6 range over the Peano integers. 
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2. Trove (10.5). 

3. Prove (10.6). 

4. Show that the set of elements R(l, b) has all the properties of the set of rational 
integers 6. 

6. Show that the integers modulo 7 form a field. 

6. Show that \/p is irrational for p prime. 

7. Show that \/2 4- \/3 is irrational. 

8. Consider the set of polynomials with rational coefficients. Deduce theorems for 
these polynomials analogous to the theorems of Sec. 10.5. 

10.6. An Extension of the Rationals. The Real-number System. 

The rationales obey the two following rules: 

1. They form a field. 

2. They are totally ordered. 

On the other hand, the rationale are not complete. Pythagoras real- 
ized that there existed so-called numbers, \/2> e ^c., which are not rational 
numbers. If we wish to include the \/2 into our number system, we 
must enlarge the field of rationals. R. Dedekind gave a fairly satisfac- 
tory treatment of irrationals. We discuss an extension of the rationals 
which differs little from Dedekind's approach and which is due to 
B Russell. 

DEFINITION 10.2. A RusseM number is a set, ft, of rationals having 
the following properties. 

1 . R contains only rationals and is nonempty. 

2. R does not contain all the rationals. 

3. If a is a rational in R and if ft is a rational less than a, then ft is in /?. 
Two examples of Russell numbers are given as follows: 

1. The set of all rationals ^1 is a Russell number. It will turn out 
to be the unit of our constructed field of real numbers. 

2. The negative rationals and all positive rationals whose squares are 
less than 2 form a Russell set or number. We shall designate it by the 
symbol \/2. 

DEFINITION 10.3. Two Russell numbers are said to be equal or equiva- 
lent if and only if every rational of Ri is contained in 7 2 , and conversely, 
with one possible exception. 

The set of all rationals less than 2, which defines the Russell number, 
two, is equivalent to the Russell number consisting of all rationals less 
than or equal to 2. The only member of the second Russell number 
not found in the first Russell number is the rational two. 

DEFINITION 10.4. The Russell number Ri is said to be greater than 
the Russell number R^ if, and only if, at least two rationals of Ri are not 
members of R 2. 

We note that, given Ri and R 2y only one of Ri = R^ Ri > R 2 , R l < R 2 
can occur. 

We must now define addition of two R numbers in such a way that the 
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result will yield an R number. Let R\ and ft 2 be two R numbers. Con- 
struct the set of rationals obtained by adding in all possible ways the 
rationals of Ri with those of ft 2 . We leave it to the reader to show that 
this new set of rationals defines an R number. What property will the 
set of all rationals less than zero have? Is this Russell number needed 
for a field? Does every R number have a negative? The student should 
readily answer these questions. 

It is a bit more difficult to define multiplication. We shall define 
multiplication for two Russell numbers. ft i, R% which are greater than 
the zero Russell number. We first delete the negatives of fti and R 2 . 
Then we multiply the positive rationals of R \ with the positive rationals 
of Ri in all possible ways. To this acquired set we adjoin the negative 
rationals. We leave it to the reader to show that the newly acquired 
set is a Russell number. An equivalent definition would be the following: 
Let C(Ri), the complement of fti, be the set of rationals not in fti. Show 
that 



is a Russell number. By C(Ri) - C(R%) we mean the complete set of 
rationals obtained by the product of rationals of C(R\) with rationals of 
C(Rz) in all possible ways. We leave it to the reader to extend the 
definition for the other cases of R} and 7? 2 . 

It is now an easy problem to show that the complete set of Russell 
numbers form a totally ordered field which contains the rationals as a 
subfield. Everything hinges essentially on the fact that we reduce our 
computations to the field of rationals. Let the reader show that the set 
of Russell numbers forms a totally ordered field. 

The field of Russell numbers has an additional property not possessed 
by the rationals. To exhibit this property, we first define the supremum 
(least upper bound) of a set of numbers (Russell). 

DEFINITION 10.5. The number s is said to be the supremum of a set of 
numbers S: (x, y, z, . . .) if and only if: 

1. s ^ x for all x in S. 

2. If t < s, there exists an element y of S such that y > t. We leave 
it to the reader to show that 1 is the supremum of the set of all rationals 
less than 1. Also, \/2 is the supremum of the set of all rationals whose 
squares are less than 2. Let the reader show that a set of numbers can- 
not have two distinct suprema. Let the reader also define the infemum 
(greatest lower bound) of a set. 

DEFINITION 10.6. A set, S, of Russell numbers is bounded above if 
a Russell number, N, exists such that N > x for all x in S. 

THEOREM 10.15. A supremum s exists for every set of numbers which 
has an upper bound. 
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Our proof is as follows: Let S be a set of Russell numbers bounded 
above by N. We construct a new Russell number as follows : Let s con- 
sist of all rationals which belong to any Russell number of S. We first 
show that s is a Russell number. 

1. s obviously contains only rationals and is not the empty set. 

2. Since N contains rationals not in any of the Russell numbers of S, 
there are rationals not in s. 

3. Let a be a rational of s and ft a rational <a. Since a belongs to a 
Russell number, say, Si of S, then ft also is in Si (why?), so that ft belongs 
to s from the definition of s. 

Thus s is a Russell number (see Definition 10.5). We next demon- 
strate that s is the supremum of the set S. 

1. s ^ all numbers of S, for otherwise there is at least one element, 
Si, of S such that Si > s. This implies that Si contains a rational not 
in s, which is impossible from the construction of s. 

2. Let r be a Russell number less than s, and let ft, y be elements of 
8 not in r. This is possible since s > r. Now ft, 7 belong to at least 
one set, Si, of S from the construction of 8. It is easy to see that Si > r. 

From Definition 10.5, s is the supremum of the set S. 
The set of Russell numbers thus 

1. Forms a field. 
(A) 2. Is totally ordered. 

3. Satisfies the property that a supremum exists for every set of 
numbers bounded above. 

Any set of elements satisfying the three properties of (A) is unique 
in the sense that they differ from the Russell numbers in name only. 
Thus the real-number system as set forth above is unique as regards its 
construction. 

To show this uniqueness, we let S and S be two totally ordered fields 
with the supremum property. We first match the zero elements and 
the unit elements. 

0<->0 



Then we match the rational integers, 



n 



Then we match the rationals, 
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Now consider any element, s, of S which has not as yet been put into 
one-to-one correspondence with an element of 8. We consider the set 
of all rationals of S less than s. It is obvious that s is the supremum 
of this set. The rationals of S which correspond to the above-mentioned 
set of rationals of S will be bounded above. Why? A supremum S 
will exist for this set. We match s <- s. Thus every member of 8 can 
be mapped into a number of S. We leave it to the reader to show that 
every number of S is exhausted (mapped completely) by this method. 
Under the above mapping we realize that 



implies 



a + b -> (JT+~B) = a + b 
ab <~> (ab) db 



Such a correspondence, or mapping, is called an isomorphism, and it is in 
this sense that we say the sets 8 and S are equivalent. 

We can now prove the Archimidean ordering postulate by use of the 
supremum. Let r > 0, and consider the sequence of numbers 

r, 2r, 3r, . . . , nr, . . . 

We maintain that there is an integer m such that mr > 1. If this were 
not so, the sequence constructed above would be bounded above and so a 
supremum s would exist for the sequence. From the properties of s we 
have s ^ nr for all n. Now consider t = s r < s. An element pr 
exists such that pr > t = s r. Thus pr + r > s or (p + l)r > s, a 
contradiction. 

We list the Archimidean ordering postulates. Any one of the postu- 
lates implies the existence of the others. 

AOi'. r > => 3 integer n =5 nr > 1. (3 == there exists, =3 == such 
that.) 

AO*: a > ft > = 3 rational r 3 a > r > ft. 
A0 3 : a > = 3 rational r 3 a > r > 0. 

A0 4 : (Eudoxus) If for every rational r > ft it follows that r > a, 
then ft ^ a. 

We now prove A0 2 from AOi. If a > ft > 0, then a > 0. From 
AOi an integer n exists such that n(a ft) > I or na > nft + 1. Now 
/3 > so that an integer m exists such that m > ftn. Let m be the small- 
est of all such integers. Thus m > ftn ^ m 1, Hence 

na > nft + 1 ^ m > nft 
so that 

.>=>, 
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Problems 

1. If b is a Russell number 5^ 0, how does one find the number b' 1 such that 
Mri = b- l b = 1? 

2. Extend the definition of multiplication of two Russell numbers for the cases for 
which both numbers are not positive. 

3. Show that the Russell numbers form a field. 

4. Show from the definition of the supremum that it is unique for a set of elements. 
6. Define the infomum of a set. 

6. Prove AO* from AO^ A0 4 from AOs, AOi from A0 4 . 

10.7. Point-set Theory. For convenience, we shall consider only 
points or elements of the real-number system in what follows. Any set 
of real numbers will be called a linear set. The field of real numbers can 
be put into one-to-one correspondence with the points of a straight line in 
the usual fashion encountered in analytic geometry and the calculus. All 
the definitions and theorems here proved for linear sets can easily be 
extended to finite-dimensional spaces. 

DEFINITION 10.7. The set of points \x\ satisfying a ^ x g &, a, b 
finite, will be called a closed interval. If we omit the end points, that is, 
consider those x which satisfy a < x < b, we say that the interval is 
open (open at both ends). For example, ^ x ^ 1 is a closed interval, 
while < x < 1 is an open interval. 

DEFINITION 10.8. A linear set of points will be said to be bounded 
if there exists an open interval containing the set. It must be emphasized 
that the ends of the interval are to be finite numbers, thus excluding 
oo and + oo . 

An alternative definition would be the following : A set S of real num- 
bers is bounded if there exists a finite number N such that N<x<N 
for all x in S. 

The set of numbers whose squares are less than 3 is certainly bounded, 
for if x 2 < 3, then obviously 2 < x < 2. However, the set of numbers 
whose cubes are less than 3 i unbounded, for # 3 < 3 is at least satisfied 
by all the negative numbers. This set, however, is bounded above. By 
this we mean that there exists a finite number N such that x < N if 
.r 3 < 3. Certainly N = 2 does the trick. Generally speaking, a set S 
of elements is bounded above if a finite number N exists such that x < N 
for all x in S. Let the reader frame a definition for sets bounded 
below. 

We shall, in the main, be concerned with sets which contain an infinite 
number of distinct points. The rational numbers in the interval < x 
< I form such a collection. 

DEFINITION 10.9. Limit Point. A point p will be called a limit point 
of a set S if every open interval containing p contains an infinite number 
of distinct elements of S. 
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For example, let S be the sequence of numbers 



/iii i \ 

V r 2' 3' 4' ' ' ' ' n ) 



It is easy to verify that any open interval containing the origin con- 
tains an infinite number of elements of S. Thus is a limit point of 
the set S. Note that in this case the limit point does not belong to S. 
It is also at once apparent that a set S containing only a finite number of 
points cannot have a limit point. 

Let the reader show that a point q is not a limit point of a set S if 
at least one open interval containing q exists such that no points of S 
(except possibly q itself) are in this open interval. 

DEFINITION 10.10. Neighborhood. A neighborhood of a point is any 
open interval containing that point. A deleted neighborhood N p of p is 
a set of points belonging to a neighborhood of p with the point p removed. 
The set of points x such that ^ < x < T&, x ^ -nnr, is a deleted neighbor- 
hood of x = -nnr. 

The definition of a limit point can be reframed to read: p is a limit 
point of a set S if every deleted neighborhood of p contains at least one 
point of S. 

DEFINITION 10.1 1. Interior Point. A point p is said to be an interior 
point of a set S if a neighborhood N p of p exists such that every element 
of N p belongs to S. Does p belong to S? If S is the set of points ^ x 
^ 1, then and 1 are not interior points since every neighborhood of 
or 1 contains points that are not in S. All other points of this set, how- 
ever, are interior points. 

DEFINITION 10.12. Boundary Point. A point p is a boundary point of 
a set S if every neighborhood of p contains points in S and points not in S. 
If S is the set ^ x ^ 1 , then and 1 are the only boundary points. A 
boundary point need not belong to the set. 1 is the only boundary point 
of the set S which consists of points x such that x > 1. 

DEFINITION 10.13. Exterior Point. A point p is an exterior point of 
a set S if it is not an interior or boundary point of S. 

DEFINITION 10.14. Complement of a Set. The complement of a set S 
is the set of points not in S. The complement C(S) has a relative mean- 
ing, for it depends on the set T in which 8 is embedded. If S y for example, 
is the set of real numbers 1 ^ x ^ 1, then the complement of S relative 
to the real-number system is the set of points \x\ > 1. The complement 
of 1 x ^ 1 relative to the set 1 ^ x ^ 1 is the null set (no ele- 
ments). The complement of the set of rationals relative to the reals is 
the set of irrationals, and conversely. 

DEFINITION 10.15. Open Set. A set S is said to be open (not to be 
confused with open interval) if every point of S is an interior point of S. 
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For example, the set of points {x\ which satisfies either < x < 1 or 
6 < x < 8 is an open set. 

DEFINITION 10.16. Closed Set. A set which contains all its limit 
points is called a closed set. For example, the set 



/ 

V 



2' r 4' 




S l (JS 2 



is closed since its only limit point is 0, which it contains. 

DEFINITION 10.17. The set of all points (or elements) which belong 

to Si or 82 is called the union (Si VJ $ 2 ) or sum (Si + 2 ) of the two sets 

Si, 82. The shaded area below is 
Si + S 2 (see Fig. 10.1). The union, 

S = VJ S a , of any number of sets, 

() 

{S a \, is the set of all points {x}, x 
in at least one of the S a . 

DEFINITION 10.18. The set of 
all points which belong to both Si 
and /S>2 simultaneously is called the 
intersection, or product, written 
5i n S 2 or Si - 82, of the two sets 
Siy S%. Graphically, the shaded 
area is Si C\ Sz. 

The theory of sets and its appli- 
cation to logic and mathematics 
was given great impetus by the Eng- 
lish mathematician, George Boole 
(1815-1864). 

Two sets Si, 82 are said to be equal , 
81 = 82, if every element of >S\ 

belongs to Sz, and conversely. If Si contains every element of $ 2 , we say 
that 82 is a subset of Si, written Si D S%. If 82 is a subset of Si and if, 
furthermore, at least one element of Si is not in 8%, we say that 82 is a 
proper subset of Si, written 

Si D 8, or S 2 C Si 

Some obvious facts of set theory are: 

(1) A + B = B + A 

(2) (A+B)+C = A + (B + C) 

(3) AB = BA 

(4) (AB)C = A(BC) 

(5) A + A = A, AA = A 

(6) A C A + B, AB C A, AB C B 




FIG 10.1 
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(7) A C C, B CC=*A +B CC 

(8) A^C,B2C=*ABD<r 

(9) AB (I A(B + C) 

(10) A(B + C) = AB + AC 

We proceed to prove (10). Let x be any element of A(B + C). Then 
x belongs to A and to either B or C. Thus x belongs to either AB or AC 
so that x is an element of AB + AC. Thus A(J5 + C) (I ,4 + AC. 
Conversely, AB (I A( + 0), AC C A(# + C) from (9")7 From (7), 
AB + AC C A (5 + C), so that A(B + C) = AB + AC. 

Problems 

1. What are the limit points of the set ^ x ^ 1? Is the set closed, open? What 
are the boundary points? 

2. The same as Prob. 1 with the point x = ^ removed. 

3. Show that the set of all boundary points (the boundary of a set) of a set S is a 
closed set 

4. Prove that the set of all limit points of a set S is a closed set. The set S consisting 
of S and its limit points is called the closure of S. Is S a closed set? 

6. Prove that the complement of a closed set is an open set, and conversely. 

6. W T hy is a set S which contains only a finite number of elements a closed set? 

7. Prove that the intersection of any number of closed sets is a closed set. 

8. Prove that the union of any number of open sets is an open set. 

9. A union of an infinite number of closed sets is not necessarily closed. Give an 
example which verifies this 

10. An intersection of an infinite numbei of open sets is not necessarily open. Give 
an example which verifies this 

11. Show that (A + ) (A + C) = A + BC. 

12. The set of elements of A which are not in B is repiesented by A B. Show 
that 

A(B - C) = AB - AC 
(A - B) + (B - A) = (A + B) - AB 
A + (B-A}=A+B 

A(B - A) = the null set 

13. Show that, if A + X = R, AX = 0, then X = f(A). The null set (no 
elements) is represented by 0. 

14. If A C S, we call S A C(A) the complement of A relative to S. Show 
that ~~ 

A C B & C(B) C C(A) 
C(T+B) = C(A}- C(B) 
C(AB) = C(A) + C(B) 

10.8. The Weierstrass -Bolzano Theorem. We are now in a position to 
determine a sufficient condition for the existence of a limit point. 

THEOREM 10.16. A limit point p exists for every infinite bounded 
linear set of points, S. 

The proof proceeds as follows: We construct a new set T. Into T we 
place all points which are less than an infinite number of points of S. 
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T is not empty since S is bounded below. Moreover T is bounded above, 
for S is bounded above. From Theorem 10.15 a supremum p exists for 
the set T. We now show that p is a limit point of the set S. Consider 
any neighborhood of p, say, 

p-e<x<p+5 >0 6>0 

Since p e < p, there is a member t of T such that t > p e. Since t is 
less than an infinite number of elements of S (why?), p e is less than 
an infinite number of S. Thus an infinite number of points of S lie to 
the right of the point p e. On the other hand, the point p + 5 has 
only a finite number of points of S greater than it, for otherwise p + 6 
would be a member of T, a contradiction, since p is the supremum of the 
set T. Thus the open interval 

p < x < p + 8 

contains an infinite number of elements of S. Since e, d are arbitrary, 
every open interval containing p contains an infinite number of elements 
of S. This proves the existence of a limit point p. 

Problems 

1. Show that the limit point p of Theorem JO. Hi is the supremum of all limit 
points of S. 

2. Prove the existence of a least or smallest limit point for an infinite bounded linear 
set S. 

3. Prove that a bounded monotonic increasing set of points lias a unique limit. 
Show that the sequence 



0+0' 



n = 1, 2, 3, 



has a unique limit. We call it e. 

4. Let Si and $2 be two closed and bounded linear sets. Consider the totality of 
distances obtained by finding the distance from any point si in S\ to any point 2 in S< 2 . 
Show that there exist two points 5 J; s 2 of Si and <S 2 , respectively, such that |s 2 s } \ 
is the maximum distance between the two sets. 

5. Let S be a closed and bounded set. Define the diameter of the set, and show 
that two points *i, s^ of S exist such that \sz i| is the maximum distance between 
any two points of S. 

6. Consider the infinite dimension linear vector space whose elements are of the 

00 

form a (ai, a 2 , . . , a,,, . . .), the a t real, such that y af converges, written 



00 

V a t 2 



i-i 



If b *= (61, 6 2 , , &n, . . ) with > 6? < B j* oo, then by definition 

t-i 
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a -f b = (01 + 61, a 2 H- 62, . . , a n -H b n , . . .) 

and xa. (xai, xa z , . . . , xa n , . . .). We can define the distance p(a, b) between 
two elements of this space (Hilbert] by the formula 



p(a, b) = [ ^ (& ~ )']* 

n = l 

Show that p(a, b) converges. Hint: Consider the sum 

V (\a n + W 2 

wl 

oo ao oo 

or ( ^ a*) X 2 + 2 (^ a,,/>,,) X + 

n = 1 n*=l 

and prove the Schwarz-Cauchy inequality, 

d -")' 



Also show that 

(1) P (a, b) = t=* a = b 

(2) P (a, b) = P (b, a) 

(3) P (a, b) + P (b, c) ^ P (a, c) 

Show that the infinite hounded set of points 



e, - (i, o, o, . . , o, . . .) 

P2 - (0, 1, 0, , 0, . .) 

g n - (0, 0, 0, . . , 1, 0, . .) 



cannot have a limit point. By a spherical neighborhood of a point 

P - (pi, P2, , p M| ) 

we mean the set of points 

x = (xi, 1-2, . ,*,...) 
which satisfy 



7. Let L be a limit point of a set S. Show that one can pick out a sequence of ele- 
ments from S, say, s\, s z , Sa, . . . , s n , . . . , such that 

Hm s n L 



n > o 



in the sense that every neighborhood of L contains an infinite number of the s t . Fur- 
thermore s n +i is closer to L than s r , . We call such an approach a sequential approach 
toL. 
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10.9. Theorem of Nested Sets. Let Mi, Af 2 , . . . , M n , ... be a 
sequence of closed and bounded sets such that MI D M 2 D M$ D 3 
M n D ' ' ' We show that there is at least one point p which belongs 
to all the sets MI, M%, . . . , M n , .... First we choose an element p\ 
from MI, then p 2 from 71/2, . . . , p n from M n , etc. The set of points 
(pi, p 2 , . . . , p n , . . ) belongs to Mi. This set is bounded since MI is 
bounded. From the Weierstrass-Bolzano theorem the set has at least 
one limit point p which belongs to MI, since MI is closed. Similarly p is 
a limit point of the set (p 2 , PS, . . . , p n , ) These points belong to 
M 2 so that peAT 2 . Finally p is a limit point of the set (p n , p n +i, ) 
so that peM n . Thus p belongs to each 3f, i = 1, 2, . . . , n, . . . . 
If the diameters of the sets M n tend to zero, then p is unique (see Prob. 5, 
Sec. 10.8) for the definition of the diameter of a set. 

Some mathematicians believe it necessary to postulate the existence of 
the set (pi, p 2 , . . . , p n , . ). More generally they desire the follow- 
ing postulate: Let S = {8 a } be any collection of sets. Then there exists 
a set P of elements \p a } such that a point p l otP exists for each set /S t of S, p t 
belonging to the set S % . This is essentially the axiom of choice (Zermelo). 

10.10. The Heine-Borel Theorem. Let $ be any closed and bounded 
set, and let T be any collection of open intervals having the property 
that, if x is an element of S, then there exists an open interval T x of the 
collection T such that x is an element of T x . The Hcine-Borel theorem 
states that there exists a sub collection T f of T which contains a finite 
number of open intervals and such that every element x of S is contained 
in one of the finite collection of open intervals that comprise T' . 

Before proceeding to the proof we point out the following: 

1. Both the set S and the collection of open sets T are given before- 
hand, since it is a simple matter to choose a single open interval which 
completely covers a bounded set 8. 

2. 8 is to be closed, for consider the set 8(\, 1, 7, . . . , 1/X . . .), 
and let T consist of the following open intervals: 



TV x3 


x- \\ 


1 

< 22 




1 


1 


HP * i* ~~~*} 
-/ 2 *- S 


r "2 





T - 

JL n . 



It is easy to see that we cannot reduce the covering of S by eliminating 
any of the given T n , for there is no overlapping of these open intervals. 
Each Tj is required to cover the point l/j of S. 
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Proof of the Theorem. Let S be contained in the interval N^x^N. 
This is possible since S is assumed bounded. Now divide this interval 
into the two intervals (1) N ^ x ^ 0, (2) ^ x ^ N. Any element 
x of S lies in either (1) or (2). If the origin is in S, then lies in both 
(1) and (2). The points of S in (1) form a closed set, as do the points of 
S in (2). Why? Now if the theorem is false, it will not be possible to 
cover the points of S in both (1) and (2) by a finite number of open 
intervals which form a subset of T. Thus the points of S in either (1) 
or (2), or possibly both, require an infinite number of covering sets of T. 
Assume the elements of S in (1) still require an infinite covering; call 
these points the set Si. Do the same for Si by subdividing the interval 
N ^ x ^ into two parts. Continue this process, repeating the 
argument used above. In this way we construct a sequence of sets 

S D S, D 8* D D tin D 

such that each S ^ is a closed set (proof left to reader) and such that 
diameters of the S n * as n > oo From the theorem of nested sets 
(Sec. 10.9) there exists a unique point p which is contained in every /S\, 
i = 1, 2, . . . , and ptS. Since p is in S, an open interval T p exists 
which covers p. This T p has a finite nonzero diameter, so that eventu- 
ally one of the /S,, say, S m , will be contained in T p . Why? But by 
assumption all the elements of S m require an infinite number of the {T} 
to cover them. This is a direct contradiction to the fact that a single 
T p covers them. Hence our original assumption was wrong, and the 
theorem is proved. 

Note that our proof was by contradiction. Certain mathematicians 
from the intuitional school of thought object to this type of proof. They 
wish to have the finite subset of T\ say, TI, 7 7 2, . . . , 7 T r , exhibited, 
such that every point p of S belongs to at least one of the T 3 , j = 1, 
2, . . . ,r. 

Problem. Assuming the Heine-Borel theorem, show that the Weierstrass-Bolzano 
theorem holds. Hint: If S has no limit points, it is a closed set. If p is a point of S 
and is not a limit point, a neighborhood N p of p exists such that N p contains no points 
of S other than p itself. Continue the proof. 

We now show that, if we assume the Heine-Borel theorem true for any 
linear point set, then the existence of the suprernum can be established. 
Let A be an infinite bounded set of real numbers. Why can we dispense 
with the case for which S contains only a finite number of elements? 
Since S is bounded above by an integer N, we can speak of the set T 
consisting of all numbers x such that x ^ every s of S, x ^ N. The 
set T is bounded. Is T closed? Yes, for if tis a limit point of T, and if 
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t < So, SoeS, then we can find a neighborhood of t which excludes so, so that 
the points of T in this neighborhood are less than SQ, a contradiction. Thus 
t ^ SoeS, so that ttT r , and 7 7 is closed. Every member of T satisfies the 
first criterion for the existence of a supremum. Hence, if no member of 
T is the actual supremum of the set S, the second criterion for the exist- 
ence of a supremum must be violated for every member of T. Thus 
every point teT is contained in a neighborhood N t , such that no member 
of S is in this neighborhood. These neighborhoods form a covering of T. 
Assuming the Heine-Borel theorem, we can remove all but a finite num- 
ber of these neighborhoods and still cover the set T. Let the finite 
collection of neighborhoods be designated by 

N: a t < x < b l i = 1, 2, 3, . . . , r 

Let a u be the smallest of the a, i = 1, 2, . . . , r. We assert that a 
belongs to T. Let the reader verify this. Then the neighborhood to 
which a u originally belonged has the property that no point of S is in 
this neighborhood. This is a contradiction since those points to the left 
of a u (<a u ) are not covered by the neighborhoods N. Hence there must 
be a supremum of the set S. Q.E.D. 

10.11. Cardinal Numbers. We conclude the study of point sets with a 
brief discussion of the Cantor theory of countable and uncountable sets. 

DEFINITION 10.19. If two sets A and B are such that a correspondence 
exists between their elements in such a way that for each x in A there 
corresponds a unique y in B, and conversely, we say that the two sets 
are in one-to-one correspondence and that they have the same cardinal 
number. 

Thus, without counting, a savage having 7 goats can make a fair trade 
with one having 7 wives. They need only pair each goat with each wife. 

DEFINITION 10.20. A set in one-to-one correspondence with the set 
of integers (1, 2, 3, . . . , n} is said to have cardinal number n. 

DEFINITION 10.21. A set which can be put into one-to-one correspond- 
ence with the Peano set of positive integers is said to be countable, or 
denumerable, or countably infinite, or denumerably infinite. 

We note that the set of integers I 2 , 2 2 , 3 2 , . . . , n 2 , . . . is, countably 
infinite and at the same time is a subset of all the positive integers. 
The cardinal number of the positive integers is called aleph zero (Xo). 

DEFINITION 10.22. An infinite collection which is not denumerable is 
said to be uncountable, or nondenumerable. 

THEOREM 10.17. A countable collection of countable sets is a count- 
able set. Let the sets be Si, $2, . . . , S n , . . . . This can be arranged 
since we have a countable collection of sets {$}. Since the elements of 
Si are assumed countable, we may arrange them as follows: 



REAL-VARIABLE THEORY 



Similarly 



<S'i: 


011, 01, 013, 


01 


S,: 


021, 022, 028, 


, 02n, 


S t : 


031, 032, 033, 


. , 03n, 


X . 













Now consider the collection of elements 

011, 012, 021, 013, 031, 022, , 01r, 2(-l), , 0r,l, 

This collection is countable since we have a first element, a second 
element, etc. But this sequence exhausts every element of {S}. This 
is done by a diagonalization process. Thus the proof of the theorem is 
demonstrated. 

We can now easily prove that the set of rationals on the interval 
^ x ^ 1 is countable. Into the set S l we place all fractions m/i, 

^ m ^ i i = 1, 2, 3, . . . 

There are a countable number of S each containing a finite number of 
fractions. The complete set of all such fractions yields the set of rationals 
of (0 g x ^ 1), this set being countable. Now let the reader show that 
the set of rationals, {r}, for which oo <r< <*> is a countable set. 

Cantor showed that the set of real numbers in the interval ^ x : 1 
is not a countable set. Assume the set countable. Then the elements of 
^ x ^ 1 can be arranged in a sequence 

S} = O.an 012 013 ' ' ' 01n * ' * 
S2 = O.a2l 022 023 ' ' ' 02n * ' ' 



S n = 0.0 n i n 2 0n3 * * " 0nn * ' ' 

where the a mn are the integers zero through 9. 

The Sj, j = 1, 2, 3, . . . , n, . . . are written in decimal notation. 
Now consider the number s = O.&i 62 b n , where 

bl = 011 2 6 2 = 022 2 ... b n = n n 2 ... 

the -2 occurring if M - = 8 or 9. The number b is certainly not among 
any of the s 3 , j = 1, 2, 3, . . . , n, . . . since it differs from every one of 
these Sj. But the collection {$,} was supposed to exhaust the real 
numbers on the interval ^ x ^ 1. Thus a contradiction occurs, and 
the real numbers are not countable. 
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Problems 

1. Show that the set of algebraic numbers, those numbers which satisfy polynomial 
equations with integral coefficients, is a countable set. 

2. Why are the irrationals uncountable? 

3. Consider the Cantor set obtained as follows: From the interval ^ x ^ 1 delete 
the middle third, leaving; the end points ^, ^. Repeat this process for the remaining 
intervals, always keeping the end points. The set that remains after a countable 
number of such operations is the Cantor set. Since each removed set is open, show 
that the Cantor set is a closed set. If the numbers ^ x ^ 1 are written as decimal 
expansions to the base 3, show that the Cantor set can be put into one-to-one corre- 
spondence with the set of reals on ^ x 5s 1 when these numbers aie written as 
decimal expansions to the base 2. Show that every point of the Cantor set is a limit 
point of the set. 

4. Show that a limit point exists for a nondenumerable collection 8 of linear points. 
The set need not be bounded. Hint' Consider the intervals N ^ x ^ N + 1 for the 
rational integers N. At least one of these intervals must contain a nondenumorable 
number of 8. Why? 

6. Show that a limit point exists and belongs to a nondenumerable collection S of 
linear points. 

10.12. Functions of a Real Variable. Limits and Continuity. Let G 

be any linear set of real numbers. G may be an open or closed set, a finite 
set of points, the continuum, a closed interval, etc. We write G = {x}, 
where x is any element of G. Assume that to each x of G there corre- 
sponds a unique real number y, x > y, and let H {y} be the totality of 
real numbers obtained through the correspondence. This mapping of 
the set G into the set // defines a real-valued function, usually written 

y = f(%) x m G 

We say that y is a function of x in the sense that for each x there exists 
a rule for determining the corresponding y. The notation f(x) simply 
means that / operates on x according to some predetermined rule. For 
example, if y = f(x) = x 2 , 3 ^ x < 5, the rule for determining y given 
any x of the set 3 ^ x < 5 simply is to square the given number x. 
Consider Table 10.1. The set G consists of the numbers x = 1, 2, 5, 14, 

TABLE 10.1 



X 


-i 


2 


5 


14 


y 


3 


8 


-2 






and the set H consists of the corresponding values y = 3, 8, 2, 0, 
respectively. The table defines y = /(x), x in G. The reader is familiar 
with the use of cartesian coordinates for obtaining a visual picture of 

3/ =/(*) 
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An important concept in the study of functions is the limit process. 
One reason why the student of elementary calculus encounters difficulty 
in understanding the theory of limits is simply that the concept of a 
limit rarely occurs in physical reality. The statement lim x 2 = x\ is 



X-+XQ 



quite confusing to beginners. To the novice this statement appears to 
be trivially true, for is not x 2 xl when x = XQ? It must be understood 
that the statement x - Xo is an abstract entity invented by man. We 
visualize a variable x which is orced to approach the constant XQ in such 
a manner that the difference between x and XQ tends to zero. This leads 
us to the following definition : 

DEFINITION 10.23. An infinitesimal is a variable which approaches 
zero as a limit. 77 is said to be an infinitesimal if \tj\ becomes and remains 
less than any preassigned e > 0. 

If 77 i and 772 are infinitesimals, then eventually \TH\ < c/2, |r/2 < e/2, 
for arbitrary e > 0, so that \rii + 772] ^ \rji\ + 772! < e, and 771 + 772 is also 
an infinitesimal. Let the reader show that the difference and product 
of two infinitesimals are again infinitesimals and that the product of a 
bounded function with an infinitesimal is again an infinitesimal. Let x 
be a variable which m succession takes on the values 1, 3, ,..., 
1/n, . . . . Certainly x is an infinitesimal since l/n > as n becomes 
infinite. For any value of x the sum S x -\- x -\- x -{- ' ' 'is infinite. 
On the other hand each term of $ tends to zero. Hence an infinite sum 
of infinitesimals need not be an infinitesimal. For another example, let x 
be an infinitesimal, so that # 3 /(l + x 2 ) n is also an infinitesimal for n = 0, 
1, 2, .... Now 



so that S is also an infinitesimal. An infinite sum of infinitesimals may 

or may not be an infinitesimal. Let the reader show by specific examples 

that the quotient of two infinitesimals may or may not be an infinitesimal. 

Let us return to the statement lim x 2 = #. To show that this 

X >0 

statement holds, we must prove that x 2 x$ is an infinitesimal whenever 
x #o is an infinitesimal. Now \x 2 x\\ = \x XQ\ \x + #o|, and 
| a: + XQ\ is bounded for x near XQ. Since \x x \ is an infinitesimal (by 
our choice), of necessity x 2 x\ is an infinitesimal, so that x 1 x\ > 0, 
or x 2 * .TO, whenever x > XQ. 

Let the reader prove the following results: If lim /(x) = M, 

x*a 

lim g(x) = N 
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then 

(1) lim [/(x) g(x)} = M N 

(2) Ito [/(x)0(x)] = MAT 

M 



N 



N 5*0 



A knowledge of limits enables us to introduce the concept of continuity 
of a function. We consider the functional relationship y = /(x), and 
we let x = x be a point of G with y Q = /(x ). What shall we mean when 
we say that f(x) is continuous at x = x ? Intuitively we feel that, if x is 
any point of G which is close to x , then /(x) will be continuous at x = x 
if the corresponding value y will be close to y$. Actually we desire 



y 



y =/(*)/ 




FIG. 10.2 



y yo to be an infinitesimal if 
x XQ is an infinitesimal. 

DEFINITION 10.24. /(x) is said 
to be continuous at x = x if for 
any e > there exists a 5(e) > 
such that 

|/(X) -/(So) | <6 

whenever |x x | < 6, x in G. 

Let us look at the graph of Pig. 
10.2. For any c > we construct 
the horizontal lines y = y + c, 
y = y Q . If we can draw two dis- 
tinct vertical lines x = XQ + 5, x = XQ ~ 6 such that, for every point x of G 
in the open interval XQ 6 < x < XQ + 5, /(x) has a numerical value lying 
between /(x ) c, /(x ) + , we say that /(x) is continuous at x = XQ. 
The distinct vertical lines must exist for every e > 0. Note that the 
maximum size of 6 will depend on c. The smaller e is, the smaller 6 will 
have to be. Moreover the p"oint x will influence the value of 6. The 
steeper the curve, the smaller 5 becomes. 

Two equivalent definitions of continuity are as follows: /(x) is con- 
tinuous at x = x if: 

(a) lim /(x) = /(x ), x in G. 

X-+XO 

(b) For any c > there exists a 6 neighborhood of x such that |/(xi) 
f(xz) | < e whenever Xi and X2 are in G and belong to the 6 neighborhood 
of XQ. 

Functions are discontinuous for two reasons: 

1. lim /(x) does not exist. 

x *xo 

2. lim /(x) exists but is not equal to /(x ). 

X 
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Example 10.1. Consider /(x) - x + 1, x * 2,/(2) - 5. We have 

hm (x + 1) - 3 

x-+2 

However /(2) = 5 ^ 3, so that /(a;) is discontinuous at x = 2. If we were to redefine 
the value of /(x) at x = 2 to be 3 instead of 5, we would remove the discontinuity. In 
this example f(x) has a removable type of discontinuity. 

Ol/x 

Example 10.2. Consider /(x) > x 5* 0, /(O) - 1. We have 

2 1 /* 2 1/z 

lim 9 = 1 hrn 7-37^. = 

x->0 1 "T 2 1/x _>() 1 -T 2 1/T 

z>0 j<0 

Hence lim /(x) does not exist for all manner of approach of x to zero It is obvious 

x-0 

that f(x) does not have a removable type of discontinuity at x ~ 0. 

Example 10.3. f(x) = sin (l/x), x ^ 0, /(O) = 2. Since /(x) oscillates between 
1 and -h 1 as x > 0, lim /(x) does not exist and f(x) is discontinuous at x = 0. 

x-*0 

Example 10.4. /(x) = l/x, x 5^ 0, /(O) = 41 Some authors consider that \/x 
is discontinuous at x = since division by zero is undefined. In this example 

f(x) is discontinuous at x = since hrn l/x does not exist. On the other hand 

z->0 

/(x) (x 2 l)/(x 1), x 7^ 0, /(O) 2 is continuous at x = 1 even though 
(x 2 l)/(x 1) is undefined at x = 1. 

Example 10.5. If /(x) = x 2 for ^ x ^ 1, we note that /(x) is continuous every- 
where on this interval. Why is /(x) continuous at x = and x = l? If a; =* c is a 
point of this interval, we have x 2 r' 2 | = \x c\ |x -f c\ < 3|x c| so that 
|x 2 c 2 | < * whenever |x r| < e/3 = 6. We have found a value of 5 independent 
of the point x = c. We say, therefore, that /(x) = x 2 is uniformly continuous on the 
interval ^ x ^ 1. 

Example 10.6. Let/(x) = l/x for a ^ x ^ 1, a > 0. It is easily seen that/(x) is 
continuous for every x of this range. For x c we have 



1 __ 1 

x c 



x c 



Thus 



1 



1 



J if -j |x c| < e or |x c| < 2 6 = 5. Since 5 = a 2 e is independ- 
ent of x = c, we have uniform continuity. On the other hand /(x) is also continuous 
at every point of the interval < x ^ 1 . But /(x) is not uniformly continuous on this 

range since 6 > as a > 0. For any e > we cannot find a 5 > such that 

x c 

< whenever \x c| < d for all c on the range < x ^ 1. Note that the interval 
< x ^ 1 is not a closed set. 

DEFINITION 10.25. Let f(x) be defined over the range G. f(x) is 
said to be uniformly continuous on G if for every > there exists a 
5 = 5(c) such that 

l/(*i) -/(a*)! < c 

whenever \xi #2! < 5, x\ and #2 in G. 

THEOREM 10.18. If G is a closed and bounded set and if f(x) is con- 
tinuous at every point of G, then /(#) is uniformly continuous on G. 
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Proof. We base the proof of this theorem on the Weierstrass-Bolzano 
theorem. First let us assume that f(x) is not uniformly continuous on 
the closed and bounded set G. This means that there exists at least one 
= such that for any 5 > there will exist Xi, Xi such that |/(xO 
/(xi)| > eo with |xi Xi\ < 6. We take 5 = 1 and obtain the pair of 
numbers x\, x\. Then we take 6 = -J- and obtain the pair (x 2 , x 2 ). For 
5 = 1/ri we have the pair (x n , x n ) such that \f(x n ) /(x n )| > e with 
\x n x n | < l/n = 5. By this process we generate the two sequences 

(Xi, X 2 , . . . , X n , . . .), ( 5 1, X 2 , . . , X n , ) ShlCC G IB & dosed 

and bounded set, the sequence (xi, x 2 , . . . , x n , . . .) has a limit point 
x = c belonging to G. We pick out a subsequence which converges 
sequentially to x = c. The corresponding points from the sequence [x n \ 
have the same limit, x = c,since|o;n x w | < l/n. Let these two sequences 
be designated by {x' n \, \x' n }. Since /(x) is continuous at x = c, we have 
I/GO - /Ml < o/2, |/($ n ) - /(c)| < eo/2, for n sufficiently large. Thus 
I/OO -~/(Zn)l < c o, a contradiction, since for all pairs x^, x' M |/(x n ) 
/(* n )| > 6 . Q.E.D. 

THEOREM 10.19. If /(x) and 0(x) are continuous at x = a, then 
/(x) g(x), /(x)0(x), f(x)/g(x) with 0(a) ^ 0, are continuous at x = a. 
The proof is left to the reader. 

THEOREM 10.20. If /(x) is continuous over a closed and bounded set 
G, then /(x) is bounded. 

Proof. Assume /(x) is not bounded. There exists an x\ such that 
/(xi) > 1, an x 2 such that /(x 2 ) > 2, . . . , an x n such that /(x n ) > n, 

. . . The set (xi, x 2 , . . . , x, . . .) has a limit point x belonging 
to G since (? is a closed and bounded set. There exists a subsequence 
(xj, Xg, . . . , x^, . . .) which converges sequentially to x. Thus from 
continuity 

lim /(x;) = /(x) 

Xn' Z 

But /(x n ) > co as x^ > x, a contradiction, since /(x) has a specific real 
value at x = x. 

THEOREM 10.21. If /(x) is continuous on a closed and bounded set 
G, then a point x of G exists such that /(x ) S /(x) for all x of G. 

Proof. From Theorem 10.20/(x) is bounded above as x ranges over G. 
Thus the set of values {/(x) } has a supremum, say, S. Thus S ^ /(x) 
for all x of G. Choose Xi such that /(xi) > S 1, x 2 such that /(x 2 ) > 
S I, . . . , x n such that /(x n ) > S l/n, .... Why is this possi- 
ble? The set (xi, x 2 , . . . , x n , . . .) has a limit x belonging to G. We 
pick out a subsequence {x n } which converges sequentially to x, with 
x^ = x m . Now/(x m ) > /S 1/m, so that 



lim f(xm) ^ lim ( S - - } = S 



(*-!)- 

V / 
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From continuity, lim f(x^) = lim f(x m ) = /(x), so that /(a) ^ S. But 

n > oo m * oo 

S ^ /(z), so that/(s) = 5. Q.E.D. 

THEOREM 10.22. Let f(x) be continuous on the range a ^ x ^ 6. If 
/(a) < 0,/(6) > 0, there exists a point c such that /(c) = with a < c < b. 

Proof. Let T be the set of points of a ^ x ^ 6 such that /Or) < 0, and 
if f(xi) < 0, x z < Xi, then /(z 2 ) < 0. We speak only of x on the range 
a ^ x g Z>. T is not empty since x = a belongs to T. T is bounded 
above since x = b does not belong to T. Hence T has a supremum; 
call itx = c. Now/(c) = 0, or/(c) > 0, or/(c) < 0. Assume /(c) > 0. 
Since f(x) is continuous at x = c, we can find a 5 such that 



whenever |jc - c\ < 5, 5 > 0. Thus/(z) > /(c)/2 > whenever |x - c| 
< 5. But /(a;) < for x < c, a contradiction, so that/(c) g Let the 
reader show that /(c) < cannot occur, so that /(c) = 0. 

Problems 

1. For the following functions find a 6 = 5(e) such that \f(xi) f(x 2 )} < c whenever 
\x\ x z \ < 5: 

(a) /(JT) = 2x* for -1 g JT ^ 1 

(b) /( X ) = sin x for ^ j- ^ 27r 
(r) f(x) = -v^l"^ a: 2 for - 1 g a; ^ 1 

2. Show that g(x) = x sin (I/a*), x 5^ 0, /(O) = 0, is continuous at x = 0. 

3. Show that y = sin (1/jr) is not uniformly continuous on the range < x ^ 1. 

4. Find an example of a function /(x) which is continuous on a bounded set, but /(a) 
is not bounded. 

6. Find an example of a function f(x) which is continuous on a closed set, but f(x) 
is not bounded. 

6. Prove Theorem 10.19. 

7. Discuss the continuity of f(x) = when x is rational, f(x) = x when x is irra- 
tional, /(O) = 0. 

8. Let f(x) when x is irrational, f(x) l/q when x = p/q, (p, (/)=!. Show 
that f(x) is discontinuous at the rational points and is continuous at the irrational 
points. Hint: Show that there are only a finite number of rational numbers p/q with 
(PJ #) =* 1 such that l/q > e for any > 0. 

9. Prove Theorem 10.18 by making use of the Heine-Borcl theorem. 

10. Prove Theorem 10.20 by making use of Theorem 10.18. 

11. Let f(x) g(x] for x rational. If f(x) and g(x) are continuous for all x, show 
that /(x) = g(x) for all x. 

12. Let f(x) and g(x) be continuous on the range a ^ x ^ 6. Let h(x) = f(x) if 
f(x) ^ g(x), h(x) = g(x) if g(x) ^ f(x). Show that 

feW = iCfW + flf(x)) + i|/(or) - 0(x)| 

Also show that h(x) is continuous for a ^ x ^ b. 

13. If /(x) is defined only for a finite set of values of x, show that f(x) is continuous 
at these points. 
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14. If f(x) is continuous at x = c with/(c) > 0, show that a 5 > exists such that, 
for \x - c\ < 5, f(x) > 0, x in G. 

15. If f(x) is continuous on a closed and bounded set G, show that a point X Q of G 
exists such that f(xo) ^ f(x) for all x in G. 

10.13. The Derivative. Let/(x) he defined on a range G, and let c be 
a limit point of G. We consider any sequence of G which converges to c, 
say, {x n }. Next we compute 



/(c) 
c 



(10.9) 



If this limit exists and is independent of the particular sequence which 
converges to c, we say that /(x) is differentiable at x ~ c. We represent 
this limit or first derivative by /'(c). 

The definition above is equivalent to the statement that for any e > 

there exists a 6 > and a number, 

f'(c), such that 



y 



o 




- f(c) 



x c 



< 



whenever 0< \x c\ < 6, x in G. 

It follows almost immediately that 
it /'(c) exists then/(x) is continuous at 
.1 = c, for 



FKJ 10.3 



\f(x) ~/ 



- c) 







as JT > c implies /(a;) > /(c) as a: > c. 

The reader is well aware of the geometric interpretation of the first 
derivative (see Fig. 10.3). In Fig. 10.3, 



PR = A* 



= f(x + 



- f(x) 



the slope of the secant line L joining P to Q is" 



-/(a?) 



the slope 



of the tangent line T is defined as /' (x), and RS = f(x) Ax is called the 
differential of /(x). 



Example 10.7. Let /(x) = x 2 sin (1/x), x 
differentiable at x =0, we compute 



0, /(O) 0. To see whether f(x) is 



lim 



. 

X 



lim 



_ = hm x sin = 

x->Q X 



Hence /'(O) =* 0. Let the reader show that/' (x) is not continuous at x = by showing 
that lim f'(x) does not exist. 
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The reader can verify that, if f(x) and g(x) are differentiable at x = c, then 

j- x [/(*) 0(*)] - f(c) + g'(c) 

lr-e = f(c)g'(c) + 0(f)/'(c) (10.10) 



The following; differentiation rules are known to hold: 



d (constant) n d cos x 

- ^ - =0 . 

dx dx 

dx n 



, >^ .- = e x 

dx dx 

d sin x d In x 1 

- y = cos x j = - 

dx dx x 

Example 10.8. Implicit Differentiation. If y /(M) and u = <p(x), we say that y 
depends implicitly on x. Let MO = <P(XO), and assume that -^ exists at u = MO, -? 
exists at x = xo. We show that 

dy _ df d<p _ dy du 
dx du dx dy dx 

at x = Jo, u no. From 

A?/ /(w -f AM) -/(MO) 

we have 

A// _ /(MO -f A?^) /(MQ) y?(x -f Ax) ^(x ) A// p^ ) 

_ _ __ t (lu.ll ) 

Ax AM Ax Ax ^ J 

If, as Ax 0, AM does not vanish infinitely often, then eventually AM 7^ 0, and 

dti , A?/ < 
-7- = lim ~ = - 



since the limit of a product is the product of the respective limits, provided they exist. 
On the other hand, we cannot apply (10.11) if AM == infinitely often as Ax 0. For 

this case, however, -j- = hm = 0. But for those values of Ax for which AM = 
dx A _ Ax 



we have *H . fM- . o, so that lim - 0. Hence - still holds 

Ax Ax AZ-+O Ax dx dy dx 

since both sides of this equation vanish. 

Example 10.9. The Rule of Leibnitz. The derivative of a product u(x)v(x) is 

d , . dv . du 

-j- (uv) = u -j h v -j- 
dx dx dx 

If we differentiate again, we obtain -r-r (MV) = u -r- 9 4- 2 3^ ~ + v -~ Let the 

ox 38 ax z ax ax ax z 
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reader prove by mathematical induction or otherwise that 

dn ( \ _ d " v _L ( n \ du dn ~ lv _L f n \ d* u d "~ 2v -i. 
5? (UV) " W d^ + Vl/ Sd?= + W dx* d*- 2 + ' ' * 



with -77 = w, -TT = f. As an example of Leibnitz's rule we consider the Legendie 
dx l} dx (} 

1 d n (x 2 _ l) n 
polynomials P n (x), denned as P n (x) - ^-. . >P (x) = 1, n = 0, 1, 2, . . . . 

i Til (IX 

We have 



/ \ i n 1 
-W + 2*-in 



. 

(n - 



dx dx 

- x ^r*M 4. n P n -i(Jc) (1013) 

Moreover 

r^ , x 1 d H f/ 



._ ( r 2 __ i \ ( (^ T 2 IV*" 1 4- 2r>r (r 2 IV'" 1 

2"n! L dx n l ; ^ dx"- 1 l j 



w + .g +fJ .. , w ,,, 

( 1 oinbining (10.13) with (10.14) yields 



(10.15) 



^ [(1 - x) ^^] + W(TI 



Equation (10.15) is Legendre's differential equation (see Sec. 5.13). 



From Theorem 10.21 we readily prove Theorem 10.23 due to Rolle. 

THEOREM 10.23. Rolle's Theorem. Let f(x) be continuous in the 
interval a g x ^ 6, and let f(x) possess a derivative at every point of 
this interval. If /(a) = f(b) = 0, there exists a point x = c, a < c < 6, 
such that /'(c) = 0. 
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Proof. From Theorem 10.21 there exists at least one point x = c such 
that f(c) ^ f(x) for a ^ x b. Now 

Ac) -lim # + *>--&-> 5=0 

fc-o ft 



since /(c + /O = /(c) . Moreover 



Of necessity, /'(c) = 0. Q.E.D. The condition of the theorem can 
be weakened by assuming that /'(c) exist at x c, with /(c) a local 
supremum or inf emum of f(x) . Let the reader give a geometric; interpre- 
tation of Rollers theorem, and let him construct an example of a continu- 
ous function, /(#), /(a) = /(&) = 0, such that nowhere is f(x) = 0, 
a ^ x ^ b. 

Law of the Mean. Let f(x) and <p(x) be differentiable functions on 
the interval a ^ x ^ b. We show that a point x c exists such that 

( v (b) - *(a)]f (c) = [/(&) - f(a)] v '(c) (10.16) 

The function 



- /(a)] - /(*)[?(&) - *(a)] - v (a)f(b) + <t(b)f(a) 

vanishes for x a and x = b. $(x) was obtained by finding A, B, C 
such that $(x) = A<p(x) + J5/(x) + C vanishes for x = a, x = 6. Apply- 
ing Rollers theorem to \f/(x) yields the theorem of the mean. As a special 
case, choose <p(x) = x, so that (10.16) becomes 

/(&) - /() = / x (c)(6 ~ a) a <c <b (10.17) 

Let the reader give a geometric interpretation of (10.17). 
L' Hospital' s Rule. From (10.16) we have 

/(b) - /(a) _ /'(c) 



- <p(a) 



a < c < b (10.18) 



provided <p(b) <p(a) -^ 0, ^>'(c) ?^ 0. Let us assume that /(a) = 0, 
<p(a) = 0, so that 



and lim = lim = lim 

;>_ (p'(c) f _ ^ '(C) 



since a < c < 6. We can rewrite (10.19) as 

lim ^f( = lim f& (10.20) 

' 
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which is L'HospitaFs rule. We cannot apply (10.20) unless /(a) = 0, 
*(a) = 0. What can be done if /' (a) = 0, /(a) = 0? If /' (x) and <p'(x) 
are continuous at x = a, and if <p'(a) 9* 0, then (10.20) can be written 



(10.21) 



provided /(a) = 0, <p(a) = 0. 
Example 10.10 



sm x - x + z 3 /6 , cos a* - 1 + 

lim -- r" = lim --- 



-0 



5.r 4 



Now g(x) = cos r 1 -h # 2 /2 and h(x) = 5x 4 both vanish at x =0 and are difforonti- 
able. Reapplymg (10.20) yields 

.. sin x x + x 3 /Q . -sin x -f x ,. cos x + 1 
hm - 7 - * Inn - - = lim - -~ --- 
x *->o 



, . sm x , . cos x 1 
hm --r- = hm ^r- = J^T 
J^H-O 120 120 



We complete this section with a discussion of the second derivative. 
Let f(x) be differentiate in a neighborhood of a: = c. The second deriva- 
tive of f(x) at x = c is defined as 



provided this limit exists. Now f'(.r) = lim*^ ----- ^ - ^-- so that 



f'(c) = lim^ /'(c + *) = Um 

/.-.O ft ' A->0 



and f"(c) = lim 



nm i im /(c + k + h) - f(c + k) - f(c + ft) + /(c) 



The order in which we take these limits is important. It can be shown, 
however, that under certain conditions we can replace k by h so that only 
one limit process is necessary. W T e would thus have 

/" (c) - lim f(c + 2k)-2f(c + h) + f(c) (10>22) 



We now investigate under what conditions (10.22) yields /"(c). Let 
U(c) = f(c + A) - /(c), U(c + K) = f(c + 2h) - f(c + h),so that 

U(c + A) - C7(c) = /(c + 2A) - 2/(c + A) + /(c) 
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Applying the law of the mean as given by (10.17) yields 

hU'(c) = h[f'(C + h) - /'(c)] = /(c + 2/0 - 2/(c + h) + /(c) 
Applying (10.17) once more yields 

hT(c) = f(c + 2ft) - 2/(c + ft) + /(c) (10.23) 

with c < 5 < c + h. To obtain (10.23), we assumed that f"(x) exists 
in a neighborhood of x = c. If, moreover, f"(x) is continuous at a: = c, 
we note that (10.22) results. 

Problems 

1. Derive the results of (10.10) 

2. Show that if y x m/n , m and n integers, then -^ = x m/n ~ l by assuming 

(IX fl 
_____ wa .n 

for any positive integer n. 

cii ^ x i- COS 1 + X 2 /2 1 

3. Show that hm 1 = r^- 

4. Let w(z) = tan^ 1 x. From (1 + z 2 ) -7-^ -f 2x ~ find -^ at z for 

dx dx dx n 

n = 2, 3, 4, . , by the use of Leibnitz's rule 

5. Show that -j (e ax f(x)) e aT (D -f a) (n) f(x) with D ~ ^-> /) 2 = -r-v t 



fc-o 

6. Find an expression for D n [f(x} cos \x]. 

7. Prove Leibnitz's rule by mathematical induction. 

8. If f(x) Qfor a x b, show that/(x) = constant for a ^ x ^ b. 

9. Show that (10.17) can be written as /(6) -/(a) = (6 - a)/'(a + 0(6 - a)), 
< e < 1. 

10.14. Functions of Two or More Variables. We say that /(x, y) is 
continuous at x = #o, y = 2/o if for any > there exists a 5 > such 
that 

l/(z, y) -/(zo, 2/0) I < c 

whenever |x z | 2 + |y 2/o| 2 < 5. In general f(xi, #2, . . , # w ) i g 
continuous at (#?, x^, . . . , x^) if for any e > there exists a 6 > 
such that 

t.. /*. \ /f/yO 'rO /.(r\ I s* , 

If J/2, j J/nJ J \<i) Jsty | Jsft) | V. 



n 

whenever Y \Xi x$\ 2 < 8. 



Theorems 10.20 and 10.21 can be shown to be true for a continuous 
function of n variables, n finite. 
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An example of a function which is continuous in x and is continuous in 
y but is not continuous in both x and y is as follows : Define 



xy 



x 2 + y 2 7* 



/(O, 0) = 



Then lim f(x, 0) = lim = 0= /(O, 0), so that f(x, y) is continuous in x 

r->0 r->0 

at (0, 0). The same statement applies to the variable y. However, 



since lim f(x, y) does not exist. This can be seen by letting y = mx, 

x->0 
V-0 

fTLX^ Vfli 

m 7 0, and noting that lim /(x, y) = lim -^ , , 9 = t : - ^ Since 

*-() *-+o * 2 + ra 2 x 2 

i/-o 

m is arbitrary, lim f(x, y) does not exist. 



m 



One defines the partial derivatives of /(x, y) at the point (x , 2/0) as 
follows : 

==: lim 

o Ax /AX 



/iOo, Z/o) = 






A?/) - 



(10.24) 



A// 



provided these limits exist. Further derivatives can be computed, and 
we write 

d 2 / , _ d 2 / , d 2 / 



/u = 

UJU 

Example 10.11. Consider 



: dy 

x 3 y 
*+l 



f = 

/22 2 



V 1 * 



/(O, 0) = 



Then 



/i(0, 0) - ^ 

/i(o, y) - 

/ 

/2l(0, 0) = 



/>-><) 



0,0 



^ L MLIL^^T lim o^ 

/_*n " Ji_kft "- 



A(o,o)-| 



0,0 



o, o) ,. o A 

= lim T = 
A-^O' 1 



0, 0) - r-~V- 

' y dxdy 



0,0 



lim r = 

fc_*n n 
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In this example, 



dy dx 



a 2 / 



0,0 dx dy 



are continuous, then /i- 2 == / 2 i. 
Proof. Define 



0,0 



. We can show, however, that, if fi% and /2i, 



U = f(x + Ax, y + A?/) - f(x, y + Ay) - /(x + Ax, y) + /(z, y) 

/(* + Ax, y) - /(x, y) 
*(y -f Ay) = /(a, -f Ax, y -f A?/) /(x, y 4- Ay) 
so that (7 = *(y + Ay) - *(y) 

If we apply the law of the mean to U, we have 

p. 



Again applying the law of the mean to the variable x (y -f 0i Ay can be considered as a 
constant as far as x is concerned) yields 

U = A Arr d 2 /(* + ^ Ax, y + B, Ay) < ^ < ] 
J dx dy < 6 2 < I 

By interchanging the role of x and y one easily shows that 

+ 6 * AX > ^^ + 04 A '^) < 3 < 1 



A. A 

- Ax Ay 







Equating these two values of U, dividing by Ay Ax, and finally allowing Ax -> 0, 
Ay 0, we obtain 



dy dx dx dy 

provided these partial derivatives are continuous. 

Example 10.12. Total Differentials. Let z = /(x, y) have continuous first partial 
derivatives. Now 

z 4- Az = /(x -f Ax, y + Ay) 

Az = /(x + Ax, y + Ay) - /(x, y) 

= [/(x + Ax, y + Ay) - /(x, y + Ay)j + [/(x, y -f Ay) - f(x, y)} 

If we apply the law of the mean to both terms above, we obtain 

Az = M* + * A *> y + A ?/) Ax . */(* y + * *y) Av 

^x Oy 

with 0<^i<l,0<02<l. Under the assumption of continuity of the first partial 
derivatives we have 

df(x + 0i Ax, y + Ay) _ df(x, y) 
_ _____ 



_ __ 

2/ -r- ^2 Ay) _ a/(x, y) 



2 



where ei > 0, c 2 > as Ax > 0, Ay > 0. Hence Az becomes 

A, = ^M Ax + 3/M AJ , + 1 Ax + 2 Av (10 . 26 ) 
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The principal part of A is defined to be 



Af i/ 

If z f(x, y) 3 x, dz = do;. ~ = 1 , TT - 0, and Ax = dz. Similarly Ay = dy, so that 

ox ay 

d z = fdx + fdy = jZdx + %- dy (10.26) 

dx dy dx dy 

where dz is called tho total differential of z If x and y arc functions of /, then from 
(10.25) 

^ = <VC*V'/) A , <V(,j[) AV , ^ , ^/ 

A/ " d.r " A^ a// A* "^ l A< * 2 A/ 



If ~ and exist, we obtain 



cte _ df dx df dy 
' 



since i > 0, e 2 * as A/ - -> 0. Remember that if A > of necessity Ax * 0, 
At/-+0. 

In the general case, if u = /(x 1 , x 2 , . . . , z n ), the total differential of u is defined as 



The operator form of (10 20) is 

d[ \= dx-~\ }+dy-jt[ } (10.29) 

where the bracket can represent any differentiate function of x and y. In particular, 



and ^ = rf((fe) = rfa: + 2 rfx dy -f d?/ (10,30) 

provided - { T~~{~~' Symbolically, we can write (10 30) as 
ox dy dy dx 

Jt [dz , , d* 

^-LaS^ + ai 

In general 
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Example 10.13. Change of Coordinates. Let z = f(x, y), and consider the change 
of coordinates given by x x(u, v), y y(u, v), so that z depends implicitly on u and 
v. From (10.25) 

^ d/(*, //) A , a/(j, y) Ay Ax A // 

"" "^ Cl 



Aw dx Aw dy Aw Aw Aw 

Allowing Aw to approach zero yields 



du dx du dy dy 
Similarly, 

dz = df_ dx _^ df_ ay 
dv dx dv 

Symbolically, 



where the brackets can represent any function of x and y with x and // depending on 
any variable r Of course ~ and ~~ must, in general, be continuous. In particular 

*\~. \ '\~ I ~~ a-. fk~.l \ M i "i 



dz\ _ dx d_ (dz\ , #// f ^ /^A 
^i/ ~ du dx \dx) du dy\dx) 
_ d*z_ dx d*z dy 
~ 'dx 2 du dy dx du 

d /dz\ = dx d /dz\ ,d]/d_ (dz\ 
u \dy) ~ du dx \dy) du dy \dy) 



dx . d 2 z dy 

-f 



dx dy du dy* dn 
Example 10.14. Consider z = z(x, y}, x = r cos 0, y = r sin 6 Then 

dz dz dx . dz dt/ dz , dz 

_j_^ = ( . os -. 

dr dx dr dy dr dx 



d fdZ\ . Q d (dZ\ 

dr (dx) + Sm dr (dy) 



d 



osu ln .., _ , dydxdr ^ ~" \_ dxdydr ' d ya r j 



= COS 



Let the reader show that 

dz n dz . n dz 

^. r sm f_ r cog __ 

dO dx dy 

d*z dz dz 

=s r cos r cos 

dO* dx dy 



,,~ *, r d*z . , a 2 c ~i 

_ = - r cos - r cos - / sm 0\ - r sm + 7- r cos 



[a 2 2 f) 2 Z 1 

- r sin -f 7 ; ?* cos I 
ax ay ^ ay 2 J 
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Problems 

1. Let z = x 2 y 2 , x = e', y sin t. Find -j- by two methods 

A T i o o . r , j dx dy dx , dy Tr J 

2. Let u = x z ?/ 2 , v = x -f T/. find > --i > and * Hint 

du du dv dv 

du , n dx rt O?/ 

7 - = i = 2x 2?/ - - > 

du du du 

= = 2x ^ - 2y ^ etc. Why is it true that ?- = 0? 
oZJ a# ay dt> 

3. Let x r sin cos <f>, y = r sin sin p, 2 r cos 0. If V = V(x, y, 2), show 
that 

,*!r *!i + *!r 

a?/ 2 ^ d2 2 

' dV d d 



4. Let /(xi, X 2 , . . . , x n ) be homogeneous of degree k, that is, 
Differentiate with respect to /, set t = 1, and prove a result due to Euler, 

*/ 



6. Prove (10.31) by mathematical induction. 

6. If f(xi, x 2 , . . . , x n ) is homogeneous of degree k (see Prob. 4), show by mathe- 
matical induction that 

k(k - 1) - (k - p + I) 

10.15. Implicit -function Theorems 

THEOREM 10.24. Let x = x , y = yo satisfy F(x, y) = 0, and let us 

r\ 77T *\ T7I 

suppose that F. ? and are continuous in a neighborhood of x = x , 
*^ ' dx dy & ' 

y = ?/o, say, for \x XQ\ < I, \y y<>\ < 1. If ^ at x = XQ, y = y<>, 

there exists one and only one continuous function of the independent 
variable x, say, y = f(x), such that F(x, /(#)) = with ?/o = /(ZQ). In 
other words, if Xi is any number near XQ, there exists a unique ?/i such that 
/(#!, 2/i) = 0. It is in this sense that we speak of y as a function of x. 

dF 
Proof. Assume > at P(x Q , y Q ). The same reasoning applies to 

the case < 0. From continuity of ' there exists a neighbor- 

SF 
hood of P(x*> 2/0) such that -^ > Q > 0, Q fixed. We can decrease the 
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size of this neighborhood, if necessary, so that it is bounded by \x XQ\ < 
l> \y ~~ 2/o| < 1. In this new neighborhood of x = XQ, y = 2/0, given by 

f$J? AJ? A// 

| a; XQ\ < IQ, \y yo\ < /o, we have F, > continuous and ^ > 

ox oy dy 



Q > 0. Let R > be any upper limit of 



dF 
dx 



in this neighborhood. 



For any x and 2/ satisfying \x XQ\ < IQ, \y 2/o < IQ we have 

F(x, y} = F(x, y) - F(x , 2/0) = F(z, ?/) - F(x*, y) + F(x, y) - F(x, */) 



Let y = y + t, < e < 1 . Then 

F<V a. i - IT r ^ dF( > 3 ' + e ^' 

/- (X, y 0+t )-( X - xo) 







Since eQ > 0, we shall have F(x, yo + e) > if \x - x \ < eQ/R. Simi- 
larly F(x, 2/0 - e) < for |o: - a- < eQ//?. From Theorem 10.22 a ?y 
exists such that F(x, y) = 0. We have shown that, for any x satisfying 
\x XQ\ < eQ/R ^ IQ, a number y exists such that F(x, y) 0. We say 
that y is a function of x and write y = /(x), so that F(x t /(#)) = 0. 

Next we show that y is a single-valued function of x. Assume that, 
for any number x\ satisfying | x xo < eQ/R ^ IQ, there exist two num- 
bers ?/i and 2/2 such that /'X^i, 2/i) = and F(XI, 7/ 2 ) = 0. Applying the 
law of the mean to F(x } , y-z) F(XI, y\) = yields 



Since > 0, of necessity 1/2 = 2/i> s that y = /(x) is single-valued. To 

show that y is a continuous function of x, we note that if F(x\, yi) 0, 
F(x z , 2/2) = 0, then 

F(x 2> 2/2) - F(X!, 2/2) + F(x l} 2/2) - F^!, 2/1) = 
-xi), 2/2) , . .dF(x lyyi + e 2 (y 2 - y,)) 

- __ 



2 _ l _ . i __ _ 

flW rlJ? 

Since ^ and is bounded, it is obvious that 2/2 * y\ as # 2 > x\. 
oy ox 

This is exactly the property of continuity. 

Theorem 10.24 is a special case of Theorem 10.25. 

THEOREM 10.25. Let F(XI, z 2 , . . . , x n , z) satisfy the following 
conditions : 

1. F is continuous at the point P(a,i, a 2 , . . . , a n , ZQ). 

2. The first partial derivatives of F exist at P and are continuous. 
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3. F(di, a 2 , . . . , a n , 2 ) = 0. 

dF 

4. 2L * at P. 
dz 

By the method used in Theorem 10.24 it can be shown that a neighbor- 
hood of Po(ai, a 2 , . . . , a n ) exists such that z = J(x\, x z , . . . , x n ) and 

F(Xi,Xt, . . . ,X n ,f(Xi,Xz 9 . . . ,X n )) = With ZQ = /(i, 2 , . . . , O n ) 

Moreover 2 is a continuous function in the variables xi, #2, . . . , x n > 
For the case F(x, /y, 2) = 0, z = /Or, //), F(a, fe, c) = we have 

f (a?, ?y, 2) - /<>, 6, c) = if z = /(.r, /y) 
so that 

F(x, y, z} - F(a, y, z) + F(a, y, z) - F(a, b, z) + F(a, 6, z) ~ F(a, 6, c) = 
Applying the law of the mean to each difference yields 

^ ^ dF ( + e ^ x ~ a )> y> z ^ 4. ^ _ M ^(^ b + ^(?y - ft), g) 
^ - a) . -h / oj 



1 v ' dz 

If y = 6, we have 

lim A^ = ^ = lim 7~Z~^ = ~~ air /a* 

(10.35) 

Similarly 



fy 

Example 10.15. Consider F(x, y, z] xe* + yz y. At the point (1, 1, 0) we 
have F(l, 1, 0) - 0, and ~- = xe* + y 7* at (I, 1, 0). Thus F(x, y, z) - defines 
z as a function of x and y, z ** f(x, y), for a neighborhood of (1, 1) such that =/(!,!). 

To compute we use (10.35). Thus = e* t = xe' -f y, so that = 7- 

ox ox oz ox xe ~\~~ y 

Notice that we do not claim to solve for z explicitly in terms of x and y. 

THEOREM 10.26. Let F\{x, u, v) = 0, F%(x, u, v) = be two equations 
involving the variables (x, u, v). Suppose that the following conditions 
are satisfied: 

1. FI(XQ, UQ, VQ) = 0, ^2(^0, Mo, VQ) = 0. 

2. FI and F% have continuous first partial derivatives for a neighbor- 
hood of (XQ; MO, ^o). Of necessity FI and F% are continuous. 

3. The Jacobian of FI and F 2 relative to u and v, written 

% 77T *\ JTf 

or i or i 
du dv 



du dv 
does not vanish at (x^ u$, VQ). 
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Under these conditions there exists one and only one system of con- 
tinuous functions 



u = 



such that F(x, <pi(x), <? 2 (x)) = 0, F 2 (x, <f>i(x), <? 2 (x)) s= with UQ = <PI(X O ), 

VQ = ^> 2 (Xo). 

Proof. If ./ 7* 0, then 2 ^ or - 2 ^ 0. Without loss of generality 
on dv 

we take -^ 2 ^ 0. Also we note that J is continuous, so that J 5^ for 



some neighborhood of (x ; WG, *>o), and, moreover, -^- ^ for a neighbor- 

ed 

hood of (XG; UQ, VQ). From Theorem 10.25 we can solve for /> as a unique 
function of x and u, v = /(x, u), such that F%(x, u, /(x, u)) = 0. Hence 



by considering F 2 as a function of .r and u. From F\(x, u, v), v = /(x, u) 
we note that FI is a function of x and ?^, so that 



dFi 

di/ i 



y = constant 



dFi 



, 

constant "i 
= constant 



x = constant i 
v constant 



x = constant 
= constant 



|x constant 



r_ oifVdw 

r- constant ^rr /n 

i -constant L OP %/ OV 



Since 



0, we have 



dF 2 /dv 

ifa **>/(*>")) 



so that from Theorem 10.24 



we can solve FI(X, u, /(x, u)) = for u = <PI(X), <p\(x) unique. Hence 
v = /(x, u) = /(x, ^(x)) = <p 2 (x). Q.E.D. Theorem 10.26 holds if we 
consider FI(XI, x 2 , . . . , x n , u, v) = 0, F 2 (xi, x 2 , . . . , x n , u, v) = 0. 
The proof proceeds in the same manner, making use of Theorem 10.25. 

Example 10.16. If x == f(u, v}, y =* <f>(u, v), then Fi(x, y, u, v) s /(w, v) x = 0, 
F 2 (x, i/, w, v) ss ^>(w, v) ^/ = 0. We can solve for u and v as functions of x and i/ 
provided 

dx dx 

l)u IhT du dv 

d<p d<p 
du dv du dv 



du dv 
dy dy_ 
du dv 



(10.36) 



along with the condition that the first partials of x and y be continuous. If x = r cos 0, 
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y = r sin 0, then 



J ( ^ ) = 



dx dx 

dr d0 

cty dy 

dr d6 



cos r sin 
sin r cos 



For r 7^ we can solve for r and uniquely m terms of x and y in the neighborhood of 
any point o = ro cos 0o, 2/0 = fo sin 9o. Indeed, 

r = (x* -f 2/ 2 )* = tan- 1 ^ -TT ^ < TT 

x 

By mathematical induction one can extend the results of Theorem 
10.26. We state Theorem 10.27 but give no proof. 

THEOREM 10.27. Let f % = f t (xi, x 2 , . . . , x m , ?<i, y>2, . , ^n), 
i = 1,2, . . . , n satisfy the following requirements: 

1. Each / t , i = 1, 2, . . . , n has continuous first partial derivatives 
at the point P(a\, a 2 , . . . , a w , 61, 6 2 , . . . , b n ). 

2. / t = at 7 1 , f = 1, 2, . . . , n. 



3. 



at P. 



There exists one and only one system of continuous functions, 

which satisfy (2) such that <p t (ai, a 2 , . . . , a 771 ) = 6 t , i = 1, 2, . . . , n. 
Example 10.17. For the coordinate transformation 



we note l,hat F t = fi(xi, x< 2 , , x n ) ?y = 0, so that we can solve for x t as a func- 
tion of 7/1, y 2 , . . . , y n provided 



dF\ 







assuming that the first partials are continuous. 

For the system of linear equations y 1 = a l a x a , i 1, 2, . . . , n, we have 



dx> 



Kl 



so that we can solve for x*, i = 1, 2, . . . , n, as a function of y 1 , t/ 2 , . . . , t/ n , provided 
\a]\ * (see Sec. 1.2). 

THEOREM 10.28. Let u t = /,(xi, x 2 , . . . , x w ), i = 1, 2, . . . , n, 
have continuous first partial derivatives at P(xJ, #, . . . ,x). A neces- 
sary and sufficient condition that a functional relationship of the form 

F(UI, u<t, . . . , w n ) s exist is that 



(10.37) 
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Proof. First assume that F(UI, u z , . . . , u n ) = 0. Then 

n 

dF V dF du n 10 

5* ~ 2, a * " J-1,2, ...,n 
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(10.38) 



Since 



= can be looked upon as a linear system of n equa- 



dF dF 

tions in the n quantities, - > a = 1.2. . . . .n, with - ^ for at least 

du a du a 

one a of necessity, 

z*. 

= 

(see Sec. 1.4). Hence (10.39) has been shown to be a necessary condition 
for the existence of a functional relationship involving the u^ i 1, 2, 

dF 

. . . , n. Why is - ^ for at least one a? 

Conversely, let us assume that (10.39) holds, so that 



dx l dx< 2 

a/2 a/_ 2 



dx n 



dfn 



dx n 



= 



(10.39) 



For the sake of simplicity we assume that the minor of ^ does not 

uX n 

vanish, so that 



d/2 <tf* 

dXi 8X2 



3/1 

a/2 

dXn_i 



(10.40) 



Now let 



dx n -i 



1/1 = Ui = /i(Xi, X2, . . . , X n ) 

1/2 = U% = /2(Xi, X2, , X n ) 

n_l = W n -l = /n-l(Xi, X 2 , . . . , X n ) 

/i i ff 

yn ^n 



(10.4 
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From (10.40) and (10.41) we have 

a/i 



j (y\, y*, > y*\ = 

\Xi, Xt, . . . , X n ) 



a/i 
6x2 
a/2 



dfr 

dx n 



df n 







1 



From Theorem 10.27 we can solve for the X T , i 1, 2, . . . , n, as func- 
tions of the ?/, , z = 1 , 2, . . . , n, so that 

Xi = ? t (2/i, 2/2, - - - , 2/) i = 1, 2, . . . , n (10.42) 

and U n = / n (Xi, X 2 , . . , 3n) = ^(2/1, 2/2, - , 2/n) 

along with Ui = 1/1, w 2 = 2/2, . . , ^n-i = 2/n-i [see (10.41)]. From 



/ 1 " ~ - - -- I \x x 


Vn) \J/1, 2/2 1/n 


(see Sec. 1.2) we have 






1 
1 









dF dF dF 

dyi dy$ dy% 


= 
dF 



dF 
so that T = and w n = 



Thus 



1/2, 



0. (? is the 



functional relationship predicted by the theorem. Q.E.D. 

Example 10.18. Consider u - x + y + z, v - x y z, w = 2x. Then 

1 1 1 
1 -1 -1 
200 

Tt is obvious that G(u, v, w) **u+v wzzQ. 

Problems 
1. Let x m r sin cos $>, y = r sin 6 sin <p t z r cos 0. Show that 



u, v, w\ 
*> 2/> z / 



2. In Prob. 1 solve r, 0, ^> in terms of x, y, z. 
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3. Let u = xyz, v =* xy + yz + zx, w *** x + y + z. Show that 
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4. For J j& in Prob. 3 solve for . Hint: Assuming that we can solve for x, y, z 

du 

in terms of u, v, w since / 7* 0, we have, upon implicit differentiation with respect to w, 

/i/l. ,1l- .A/JJ /)/9 



du _ _ dx 

Thi ~ l ~ yz 

dv , 

-. O . (v 

dw __ _ dx 
_ _ _ __ 



xy ~ 



. dx . , 
g )- + ( 

<)?/ dz 
__ -!___ 



. dy 
)- 



dz 
)- 



Solve this linear system for 

du 

6. Let u = x -f ?y -f 2, ^ = x?/ + ,//2 -f- 



2/ 2 + 2 2 - Show that 



Find a functional relationship between u, v, w. 

6. Consider a function of (x 1 , rr 2 , a* 3 , . , .r M ), say, /(a; 1 , x 2 , 
the Hessian of F by 



. . , x n ). We define 



ay 



Under the coordinate transformation r a = 
. . . , y n ). Show that 



the function / becomes F(y l t y z , 



dy 1 



dx 1 



10.16. The Riemann Integral. Let us consider any bounded function 
f(x) defined over the range a ^ x ^ b. f(x) need not be continuous, 
but we desire \f(x)\ < A. We subdivide the range (a, 6) into n arbi- 
trary subdivisions, 



a = #o < x\ < 
and form the sum 



< X n = 



(10.43) 



where M % is the supremum of f(x) for the interval x % -i ^ x ^ x t . The 
sum S n obviously will depend on the manner in which we subdivide the 
interval (a, 6). The infemum of all such sums obtained in this manner 
is called the upper Darboux integral of /(#), written 



& 



(10.44) 
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Since S n <= A (b a) , the inf emum S exists for all bounded functions. 
Similarly, we can form the sum 



s n = m^x, - ov_i) (10.45) 



with m ^ the infemum of f(x) for the interval x % -\ ^ x ^ av_i. The 
supremum of all such sums obtained in this manner is called the lower 
Darboux integral of /(#), written 

x)dx (10.46) 

The reader can easily verify that S ^ S. 

If S S, we say that /(x) is Riemann-integrable (7^-integrable) and 
we write 



= l b f(x)dx (10.47) 

Ja 



We now investigate some conditions for which f(x) is jft-integrable. 

1. Let/(;r) = constant = c. Then 

n n 

S n = Y c(x t ov_i) = c Y fa Xi-i) = c(fe a) 

t=l T-l 

6' n = c(b a) 
so that S = 8 = c(b a) = I cdx. 

Ja 

2. Let/(z) = x for ^ x ^ 1. We consider the subdivision < l/n 
< 2/n < - - < i/n < < 1. Then 

S -^ i ( i i - l \- 1 Vf-*(n + l)_l/i | 
n ~ " 2 2^ 2 V ^ 



The infemum of all sums obtained by equally spaced divisions is seen to 
be ^. Similarly 



The supremum of all such sums is . Let the reader prove that S = S = -g- 
for all manners of subdivisions (not necessarily equally spaced), so that 

1 x dx = ^. 

3. A continuous function is always R-integrable for the finite range (a, b). 
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Proof. Since f(x) is continuous, it is uniformly continuous for 

a ^ x ^ b 

Hence for any > there exists a finite subdivision of a ^ x ^ 6 such 
that the difference between the supremum and infernum of f(x) on any 
interval of the subdivision is less than e/(6 a). This yields 

\S n ~ s n \ < (b - a) r-^~ = 6 

d 

If necessary we subdivide further so that |*S $ n < e, |& Sn| < e. 
Why can this be done? One must recall the definitions and properties 
of the supremum and infemum. This is left for the reader. Hence 
\S S\ < 3e. Since e can be chosen arbitrarily small, of necessity 
S S = 0, = . Q.E.D. It must be remembered that *S and are 
fixed numbers. 

4. Any bounded monotonic increasing or decreasing function is R-iute- 
grable (see Prob. 10). 

5. An example of a function which is not /Mntegrable is as follows: 
f(x) = 1 for x irrational, f(x) = for x rational, ^ x :g 1. Let the 
reader show that 1, S 0. 

We list some properties of the Riemann integral. It is assumed that 
the integrals under discussion exist. 

(1) fcf(x)dx = c [ b f(x)dx 

Ja Ja 

(2) f a b f(x)dx= - ff(x)dx 

(3) f b f(x) dx = r f(x) dx + f b f(x) dx 

Ja Ja Jc 

(4) lif(x) ^ 0, then f b f(x) dx ^ for b ^ a. 

Ja 

(5) [ " (f(x) + <p(x)} dx = ff(x) dx + ( b *(x) dx 

Ja Ja Ja 

(6) [ b \f(x)\ dx ^ f b f(x) dx,b^a 

Ja Ja 

(7) [f a b v (x)f(x) dx] 2 ^ f' <p\x) dx f a b f*(x) dx 
Proof. 

l b \fr,(x) + /(x)] 2 dx = X 2 f b <p\x) dx 

Ja Ja 

+ 2X f b <p(x)f(x) dx + ( b p(x) dx ^ 

Ja Ja 

for 6 > a. If y = AX 2 + 2J5X + C ^ for all real values of X, then 
+ 25X + C = has either two equal real roots or no real roots. 
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Of necessity B 2 - 4 AC ^ 0, which yields 

[f* <p(x)f(x) dx] 2 g I* <p*(x) dx f' / 2 (x) dx (10.48) 

(8) / f(x) dx ^ M\b a|, M = suprenium of f(x) for a fg x g 6 

Ja 

(9) r/ie Firs/ Theorem of the Mean for Integrals. If /(x) is continuous 
on the range a ^ x ^ ft, then 

[ b f(x)dx - (6 - a)/(f) a g ^ fr 
y 

The proof of (9) is left as an exercise for the reader (see Prob. 8). 

(10) ['f(x)dx = 

Ja 

We should like to emphasize that, if f(x) is /t'-integrable on the range 
(a, 6), we can write 

/ = f b f(x)dx = f b f(t)dt 

Ja Ja 

since the value of the integral does not depend on the variable which is 
used to describe /(x) . Of course 7 depends on the upper arid lower limits, 
b and a, respectively. Moreover / does depend on the form of f(x). 
Once f(x) is chosen, however, we need not use x as the variable of inte- 

gration. If /(x) is 72-integrable for the range (a, 6), then \ f(f) dt exists 

Ja 

for a ^ x ^ 6. Each value of x determines a value of I f(f) dt, so that 

Ja 

a single- valued function, F(x) t can be defined by 

F(x) = f'f(t)dt (10.49) 

Ja 

We could write F(x) = / f(x) dx, but confusion is avoided if we use 

Ja 

(10.49). From (10.49) we Have 

F(.r + A.r,) = r^ X f(t)dt 

Ja 

and 

F(x + Ax) - F(x) = [' + "f() dt - r/(t) dt 

Ja Ja 

dt (10 * 50) 



Since /(.r) is bounded for a g x ^ 6, we have 

|F(x + Ax) - F(x)| ^ ^. Ax (10.51) 

Hence F(x) is continuous since F(x + Ax) > F(x) as Ax ~> 0. 
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From (10.50) we note that 

F(x + Ax) - F(x) = /() Ax x ^ ^ x + Ax 
[see (9) above], provided /(x) is continuous on (x, x + Ax). Hence 

dx AS-+O Ax AT-+O 

.. ff(f)dt = f(.r) (10.52) 

"X 7 a 

We can obtain (10.52) without recourse to the law of the mean. We 
have 

W(v I AT*"| F(^\ 1 /*z-f Az 

* \** x 1^ LA^/y l \Js) j>/ ,_\ A / //j\ jj A / r/ \ ^j 



x + Az 

[/(O - /(x)] A 

If f(f) is continuous at t = x, we have \f(t} /(x)| < for \t x\ < 8. 
Choose | x| < 6 so that 



F(x + Aa;) - F(x) 



- /(a-) 



1 



r 



Ax 
Since c can be chosen arbitrarily small, we note that 

F'(x) = lim 



We obtain the fundamental theorem of the integral calculus as follows : 
Let G(x) be any function whose derivative is/(x), /(x) continuous. Then 
(?'(x) = F'(x) for a g x ^ 6. From Prob. 8, Sec, 10.13, we note that 
F(x) = G(x) + C, C = constant. Hence 



For x = a we have G(d) + C = so that 
G(x) ~G(a) = 



For x = 6 we obtain 

G(b) -G(a) = ( b f(x}dx (10.53) 

ya 

Hence, to evaluate / /(x) dx, we find any function G(x) whose deriva- 

Ja 

tive is f(x) ; the difference between G(b) and G(a) yields / /(x) dx. 

Ja 
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Problems 

n n 

1. From (k + I) 2 - fc 2 2k + 1 we have Y K* + *) 2 - Afl - w + 2 Y ** 

n 

that V k = ^[(n + I) 2 - 1 - n] = w(w + 0/2. Consider 
(A; + I) 3 - /c 3 - 3A: 2 + 3A: + 1 



so 



and show that fr 2 - n(n + l)(2n -f 1) 

*-i 

2. Without recourse to the fundamental theorem of the integral calculus show that 

/ x 2 dx = -g by making use of the result of Prob. 1 

3. If /(x) is continuous for a ^ x ^ 6. b > a, and if / f(t) dt = for all x on 

Ja 

(a, b), show that/(.c) ^ 0. 

4. If f(x) is continuous on (a, 6), b > a, and if, for any continuous function ^(rr), 

fb 

/ /(j-)^(o-) ^ = 0, show that /(a-) = on (a, b). 

Ja 

6. If /(x) is continuous for a ^ x ^ 6, /> > a, and if /(.r) ^ on (a, b), 

f b f(x)dx = 

Ja 

show that/Cc) = on (a, b). 

6. Let /(a;) = 1 for ^ x ^ 5, /(.T) = 2 for ^ < x g 1, and consider 



^ a? ^ 1. Obtain a graph of FOr). Why does F'(x) not exist at x = ^? Does this 
contradict (10.52)? 

7. Consider the interval ^ x f L Define /(x) as follows: /(x) = if x is irra- 
tional, /(x) = I/? if x = p/(?, p and g integers in lowest form. Show that /(x) is 
72-integrable on ^ x ^ 1. 

8. Prove the first theorem of the mean for integrals. If /(x) is continuous on (a, b), 
<p(x) 72-mtegrable on (a, 6), and <p(x) of the same sign on (a, 6), then 



( b /(x)*>(x) dx - /( ^ *>(x) rfx a ^ ^ 

Ja Ja 

9. If /'(x) and ^(z") are 72-mtegrable, show that 

f b f(x)<f>'(x) dx = /(x)^(x) 6 - r vW(x) dx 
Ja a Ja 



- /(a)^(a) - f b v(x)f\x) dx (10.54) 

Jo 



Equation (10.54) represents an integration by parts. 

10. Prove that any bounded monotonic function is -R-integrable. 
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10.17. Integrals with a Parameter. Let us consider the integral 

F(t) = f b f( x , t)dx U ^ t ^ ti (10.55) 

Ja 

After f(x, i) is integrated with respect to x, what remains depends on the 
parameter /, t Q ^ i ^ t\- Of course F(t) depends also on a and b, but for 
the present we assume that these numbers are fixed and finite. For 
example, 



is a well-defined function of t for t > 0. 

We now determine some conditions for which F(f) will be continuous 
We desire 

\irnF (t) = F(t) = l b f(x, t} dx (10.56) 

t-St Ja ' 

If - exists and is a bounded /f-mtegrable function with respect to 
at 

x for U ^ i = t\, then 



b 



[f(x,t) -f(x,l)]dx 



= \t-t\ 



^ M\t - t (10.57) 



by applying the law of the mean to/(x, t) /(#, t). From (10.57) we see 

that lim F(t) = F(t). A less stringent condition which does not imply 

t->~t 

the existence of -^- and which enables us to prove the continuity of 

F(t) is as follows: Let/(x, t) be uniformly continuous in t with respect to x 
so that for any e > there exists a 6 > 0, 5 independent of x, such that 
\f(Xj t) f(x, T)\ < e whenever \t t\ < 6. One notes immediately that 

\F(t) - F(t)\ ^ ( b \dx\ = e\b - a\ 



Since c can be chosen arbitrarily small, we see that F(i) > F(t) as t > t. 
Of more particular interest is the possibility of showing that 

dF(t) = ( 
dt ~ e 

Let us assume that - ' is continuous for a ^ a: g 6 ; / ^ ^ ^ <i- We 
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define 0(t) by 



f f f b df(x t} f b f 1 

Then / G(t) dt = / / J \! J dx dt = I / 

Jto Jto Ja O* 7a ./Jo 

P 

= / [M - /(*, *o)] dx 

= F(0 - F(i ) (10.59) 

The interchange of the order of integration can be justified since the 
integrand is continuous. From (10.52) we obtain 

dF(t) = . 
c?^ ~( 

Example 10.19. We consider 

F(t) = j *">** (is \t\ < T (10.60) 

It is obvious that ~- (e t(>nn:r ) = cos x f' tosj \Q continuous in x and t for ^ x 2S TT, 

|<| < 2 T , so that 

^ /v 

-j7 / cos x e 1 COB x dj 

dt Jo 

It is easy to justify a further differentiation so that 

d*!F_ _/""., (oos*^,. 
rf? "" Jo C S a;e 8a! 

F(t) / sin 2 x e* 08 * dx 

F(t) + /sin xd ( T e to BX ] 
Jo \< / 

- 1 / , 

7 I cos x e* CU8 x dx 
-o * Jo 



Bessel's differential equation (see Sec. 5.13) for n = is -j^ -f - -j^ -f w 0. If 

a z 2 oz 

z = i<, we have -p -f - -^- w? 0, which is satisfied by F(t) of (10.60). 

Problems 
1. Show that 

dx 



f 1 

J ( 

/6 /"6 

cos /x dx (1/0 (sin 6< sin a<) compute / x 2 '' cos to (/x, p a posi- 
1 JO 

tive integer. 
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3. Show that 

sin x dx _ 2 < a 1 



~ a cos a; + a 2 

a 



4. Given F(t) = f ^ /(*, dt, show that 
y*(fl 



for certain rost notions on -l^-'-- < ^(/) ? ^(/). 
5. By two methods show that 

J f' 2 (2x + dx - 4< 3 4- 3/ 2 - 4/ 

10.18. Improper and Infinite Integrals. The function f(x) = or*, 
a: > 0, /(O) = 1, is well defined on the interval ^ x ^ 1. Since /(x) is 
not bounded on ^ x ^ 1, we are not certain that it is /Mntegrable on 
this range. However, for < e ^ x g 1, we have 



As e > 0, F(e) > 2 and we define 

r^. = lim r * = 2 

Jo V% r:2 ^ v * 



We notice that /(O) could have been defined in any way we please with- 

ri ( { x 
out affecting the value of the improper integral / - Let the reader 



f 1 dx 

show that the improper integral / fails to exist. 

Jo % 

A Practical Rule. Let/(x) be #-integrable for the range a + e ^ x g 6, 
for all c > 0. Define \l/(x) by 

1. If for 11 < 1 we have \\l/(x)\ < M = constant for a x ^ b, then 
lim / f(x) dx exists. 

e-0 ^ a + 

>0 

2. If f or jit ^ 1 we have \$(x)\ > m > for a g x ^ f), then 

lim f /(x) dx 
fails to exist. 
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These results are left as an exercise for the reader. One easily sees 
p fa 

that lim / -, -r- exists if \i < 1 and fails to exist if /u > 1. 

-o J a+t (x - a)* 

If f(x) is continuous for a ^ x ^ b except at x = c, a < c < 6, we define 
f b f(x) dx = lim I C ~ ( f(x) dx + lim f b f(x) dx (10.62) 

Ja _> J a d ./ c + 5 



(5-0 
e>0 &>Q 



provided these limits exist. It is important that e and 5 approach zero 
independently. 

r a Anf 
Example 10 20. We consider / -r=^=- The upper limit may give us trouble. 

Jo -y/a 2 x 2 

However, f(x) = (a xY *(a -f .r)*, so that <//O) = (a x)*f(x) = (a + x)* is 
bounded for ^ x ^ a. Since M = i the integral exists and, actually, 

/"a dx _ TT 
Jo Va^^P ~ 2 

The integral / f(jr) dx is called an infinite integral and is defined as 

Ja 

follows: Let/(x) be 72-intograble for a g x g X, X arbitrary. We define 
[~f(x)dx= lim [*f(x)dx (10.63) 

Jo X -* J a 

provided this limit exists. For example, 



r ^ r P 

/ i , o = lim / r 
yo 1 + x- x-+ 7o 1 



dx v j v TT 

- 2 l im tan" 1 A = - 



Let the reader verify that / cos x dx does not exist. If / \f(x)\ dx 

70 Ja 

exists, we say that f(x) is absolutely integrable. 

From (10.03) \ve note that for any e > an XQ exists such that 



I ^ f(x) dx - P/Cr) dx 

I Ja Ja 



< for X 



x 



or ^ f(x) dx < e for X ^ X (10.64) 



One sees immediately that if / \f(x)\ dx exists then \ f(x) dx exists, 

Ja Ja 

since 

\[~f(x)dx f"\f(x)\dx 

I J -A- J -A. 

A simple test for the convergence or existence of / * f(x) dx is as 

Ja 

follows: If a function <p(x) exists such that / \<p(x)\ dx converges, and 

Ja 
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if \<p(x)\ > |/Cr)| for x ^ a, then \ f(x) dx converges. We leave the 

Ja 

proof of this statement as a simple exercise for the reader. 

Another simple test for the convergence of an infinite integral is based 
upon the integration-by-parts formula. From 



u(x) dv(x) = u(X)KX) - u(a)v(d) - P t>(x) du(x) 

Ja 

we note that, if lim u(X.)v(K) exists and if / r(x) du(x) exists, then 
x- Ja 

/u(x) dv(x) exists. 
i 

Example 10.21 We consider / dx. The origin need not concern us since 



, sin x t , r , 

lim = 1. We have 



f X sill r /* X I COS X /" X COS x , 

I dx = I - a ( cos x) ^r / dx 

Jir/2 X JTT/2X X J r /2 X* 

TWT i- cos X ,. , , /"X cos x , , /" dx ~. 

Now lim ^7 = arid lim / --- dx exists since / r exists, omce 

X-+co X X -> Jir/2 X 2 JTT/2X 2 

/W2 S m x . 

Jo ** 

exists, we see that / dx exists Its value is 
Jo x 2 

We now consider an infinite integral involving a parameter. Let <p(t) 
be defined by 

^(0 ~ / f( x i t) dx to ^ t ^ ^ (10.65) 

We assume that the integral of (10.65) exists for each value of t on the 
range U ^ t ^ t\. If we fix our attention on a specific value of t, we have 
for any e > an XQ such that 

/ v /(x, ds < X ^ X (10.66) 

I yX 

The value of Xv (for a fixed e) may well depend on t. If, for any e > 0, 
there exists an XQ independent of t such that (10.66) holds for all t on the 
range o == t ^ <i, we say that the infinite integral converges uniformly 
with respect to t. 

Let the reader show that if g(x) ^ \f(x, t)\ for t Q ^ t ^ fa, and if 

/g(x) dx exists, then / f(x, t) dx converges uniformly, 
i Ja 

Example 10.22. We consider F(l) = / e~* 2 cos xt dx. Since e~ x * ^ \e~ x * cos 
for all t and since / e~ x * dx \/ir/2 exists, F(0 exists for all <. 



xt\ 



430 
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We now show that if f(x, t) is uniformly continuous in t for a rg x X, 
X finite but arbitrary, then <p(t) of (10.65) is continuous in t provided 

f(x, t) dx converges uniformly. We have 



/ 



, t 



[f(x,t + U) -f(x,t)]dx 

+ / x " /(*, i + A<) dx - /(x, dx 
From uniform convergence we have for any e/3 > an X such that 



f(x, t + M) dx 



f(x, t) dx 



From Sec. 10.17 we note that for any e/3 > a d > exists such that 

[/Or, * + A0 - f(x, 0] cte 
for |A| < 5. This is the property that / f(x, t) dx is continuous in /, 

Ja 

X finite. Hence 

\<p(t + A) <p(0l < 6 

for |A| < 5, d > 0, so that <p(t) is continuous. Q.E.D. 

It is interesting to find a sufficient condition which enables one to write 



dx 



Let the reader first show that, if G(t) = / * 

7a ot 

formly, then 

/oo /*< 
I /v > ^^ ^ 
./fc ^ 



(10.67) 



converges uni- 



G(0 ^ = 



(10.68) 



provided 



^ 



is continuous for a; ^ a, < ^ < ^ <i [see (10.59)]. Dif- 



ferentiation of (10.68) leads to (10.67). 

Example 10.23. We consider F(t) given in Example 10.22. From 

/(x, t) e-* 2 cos 2x< 

we have ~ 2x6"** sin 2a;<, To show that / - - - dx converges uniformly for 

vt JO dt 
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all t, we note that 



I - dj(xt) dx i 

yo d jo 



L Hence 



2:ce-* 2 sin 
o 



r oo 

= / sin 2xJ rfe- x2 = ~2/ g" xl cos 
yo 

= -2lF(t) 



Integration yields F(t] Ae~ iZ = / e~* 9 cos 2xt dx At t = we have 



/o" e ~" ' 



so that 

e~ x * cos 2xt dx 
Problems 



/i I aH* 
^ - - dt exists for ^ a < 1 

O SJKnw that I 7=r_: 1 exists 

*' bh W ^^ JO V(l ~ T 2 )(l ~ frX) 

/7T/2 

3. Show that / In sin x dx exists 

Jo 

r 

4. Show that 7r/2 - tan" 1 t = / (1 //>-'* sin x dx, t ^ 

/oo COS .C 
___ ^j. converges uniformlv for all/. Evaluate this integral 

by contour integration. 

6. Show that / x sin e x dx = / xe~ x d( cos P*) exists. 

Jo Jo 

7. Let /(JT, = xt. Show that 

*i 



/o" /-i f(x > * ^ 
. Consider F(y} I - ~ dx, and show that 



10.19. Methods of Integration. We first consider the special indefinite 
integral 



In the elementary calculus the reader was told to expand (.r 2 + I)" 1 
(x 2)" 1 in partial fractions, obtaining 

1 Ax + B C 



(x 2 + l)(:r - 2) x 2 + 1 ' x - 2 

(A + C)x 2 + (B - 2A)x + C -2B 



9 , (10.70) 

X L) 
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Since (10.70) holds for all x, one has A + C = 0, B - 2A = 0, 
C - 2B = 1, so that A = -|, B = -|, C = |. The integral (10.69) 
can be written 

dx__ 
2 



/dx _ I f x dx 2 f dx , 1 f dx 

( X 2 _i_ f )TjT _ 2) ~~ ~~ 5 / x 2 + 1 """ 5 / x 2 + 1 5 / x ~ 

= - ^r In (x 2 + 1) - \ tan" 1 x + ^ In (x - 2) 
lu o o 

+ constant 

The results of Sec. 8.11 justify the expansion (10.70). Since x 2 + 1 and 
x 2 are relatively prime (there is no polynomial of degree ^ 1 which 
divides both x 2 + 1 and x 2), there exist two polynomials P(x\ Q(x) 
such that 



Thus P(x) = C, Q(x) = Ax + B, and (10.70) results upon division by 
(x 2 + l)(z 2). To evaluate the integral 

x cto 



(.T 2 + 1)(X - 2) 

one notes that 

x 1 x 2 2 x , 1 



_ __ ___ 

(x 2 + l)(x"- 2) ~ 5 x 2 + 1 5 a: 2 + 1 " 5 a: - 2 



x - 2 
so that 

T C!T 

-Z = i tan-' x - $ In (x* + 1) + f In (z - 2) + C 



If the zeros of a polynomial P(:r) are known, one can evaluate 

$$** (10.71) 

provided Q(x) is a polynomial. One writes 





A i w ^ i . . . j_ * ~i~ . . . j_ 

P(x) " r^ + ^=7; + + T=T k + + ^~=T n 

with a k = lim p( * fc = 1, 2, . . . , n. One then divides Q(z) by 
x r k and performs a series of simple integrations for k = 1, 2, . . . , n. 
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If P(x) is a polynomial such that x = r is a A;th-fold zero of P(x) 
we have 

P( X ) = (x - r)*Q(oO Q(r) * 
Thus P'(x) = (x - r^-WOr) + (x - r)Q'(z)] 



and P'(r) = provided k > 1. For fc = 1, P'(r) ^ 0, so that P(x) and 
P f (x) have no zeros in common if P(x) has no multiple zeros. In this 
latter case P(x) and P'(x) are relatively prime so that polynomials A(x) 
and B(x) exist such that 

A(x)P(x) + B(x)P r (x) = 1 (10.72) 

To evaluate an integral of the type 



J 



provided P(x) has no multiple zeros, we make use of (10.72). Thus 

C(x) A(x)C(x) B(x)P'(x)C(x) 
[P(x)] (P(x)]<-i "" [P(xf]-~ 
and 

f C( - x} i r = f A^^) rfr + __J ^KI(?) 

J fP(x)]" J [P(x)]"- 1 "*" 1 -"m [P(x)]"-' 



Since m has been reduced by 1, we can repeat this process until integrals 
of the type (10.71) occur. 

Example 10.24. We consider 

/ (X ) - f z dt - f z dt - f 1 / 2 ) " 2t di 
" (x> ~ Jo (< 2 + l)" ~ Jo (< 2 + l)"- 1 Jo (< 2 + l)" 

since 1 (V + 1) (</2)(2<) = 1. Integration by parts yields 

(t* + 1)'-" I* ^ f* dt 




- 2(n - 1)( + I)-- 

From /i(x) tan" 1 x one has / 2 (z) = ^ tan" 1 x + ^x/(x z + 1). Repeated applica- 
tion of (10.75) yields 

T i \ 1 3 5 (2n - 3) . . , , v 
^ W - 2-4.6.. . (2n-2) ^^ * + R(X) 

where /2(x) is a rational function of j. 
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If F(x) is a polynomial, it can be written in the canonical form 

F(x) = X,X\ X 

with X\j X%, . . . , X m , polynomials relatively prime to each other. 
Since X\ is relatively prime to X\ - - - X%, we have 

A(x)Xi+B(x)Xl X = I 

.. . B(x) , A(x) 1 

so that ~- + ^ - 



Y Y 2 Y3 . . . Y m Y Y 2 Y m 

*\ i ^\ 2 ^ 3 ^ m ^ 1^ 2 A w 

Continuing this process by noting that X\ is relatively prime to X\ 
X% y one can write 



F(x) X, ' X\ ' ' XZ 

if f(x) is also a polynomial. The evaluation of 

dx 



F(x) 

is reduced to an evaluation of integrals of the type given by (10.73), 
which in turn can be reduced to the simpler form given by (10.71). 
Let R(x, y) be a rational function of x and T/, 



, y) - '~^r (10.76) 

2 2 MV 

y_0 t-0 

To evaluate 

TB) do: (10.77) 



with y = \/Ax + #, one notes that the change of variable t 2 = Ax + B, 
dx = (2tdt)/A, x = (t 2 - B)/A, reduces (10.77) to an integral of the 
type discussed above. 

Example 10.25. To evaluate 

/ ~^f^ dx (10J8) 

we let P - x 4- 1, 2< dt - dx, so that (10.78) becomes 



aad / V*i ^.2x^+1+ to ^1^-1+ C 

* Vx + 1 + 1 
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We can evaluate 



fR(x, V^nr^+~c) dx (10.79) 

provided R is a rational function. Let x = a, y = satisfy 



so that # 2 == A a 2 + Ba + C. The equation of the straight line through 
the point (a, ft) is y ft = t(x a.) with / the slope. The intersection 
of this straight line with the curve /y 2 = Ax' 2 + Bx + C is easily seen to 
yield the coordinates 

(B + 2Aa - 20t)t (1 ' 80) 

y = ft + 2-_L~4 

Since x, T/, and da: are rational functions of t, one can integrate (10.79) 
by the methods discussed above. 

Example 10.26. We evaluate / -^=^. We have ?/ 2 * x 2 + p 2 , A - 1, 

/ x v x 2 + p 2 
B ** 0, C = p 2 , and we choose so that & = p. From (10 80) 



2pt 1 + / 2 

'-i-7. -P!--/i 

so that f -^L^ - ] [ dt ! - ! In -P ^ ^ i >"P i + r 

y x \/x 2 + p* P J t p * 

The integral (10.79) is a special case of the following type of integral. 
Suppose R(x, y) is a rational function of x and ?/, with y depending 
implicitly on x through the relation f(x, y) = 0. If the curve f(x, y) = 
is unicursal in the sense that we can describe the curve parametrically by 
x = <p(t), y = \l/(t) with v(t) and \(/(t) rational functions of t, then 

JR(x, y) dx = fR(v(t), WW() dt (10.81) 

and this integral can be evaluated by the methods discussed above. 

A rational function of the trigonometric functions can be integrated 
by the use of the following change of variable : 



. X /I COS X 

2 = tan - = 



" rrr^ (10-82) 

2< 
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For example, 



The elliptic integral of the second kind appears in a natural manner 
if one attempts to find the arc length of an ellipse. Let x a sin <p, 
y b cos <p, ^ v? < 2ir be the parametric representation of an ellipse. 
Then ds 2 = dx 2 + dy* = (a 2 cos 2 <p + b 2 sin 2 <p) d<p 2 , so that 



L = a / \/\ e 2 sin 2 (p d<p 



withe 2 = (a 2 6 2 )/a 2 < 1 if 6 2 < a 2 . The elliptic integral of the second 
kind is denned as 



E 2 (k, <?) = j \/l ~ W sm T 7 d<p \k\ < 1 (10.83) 

The period of a simple pendulum leads to the elliptic integral of the first 
kind, given by 



sn 



|fc| < 1 (10.84) 



Extensive tables can be found in the literature for the evaluation of these 
important elliptic integrals. 

Problems 

Integrate the following: 




for a > 0, a < 

-f bx -f c 



. /" , ^ - f or a i > 52 a 2 < 52 
] a -\- b cos cz 

y a sic 





sin ex + b cos cz 

10. Let a and be real roots of Ax* -f Bx + C - 0. Show that 



+ Bx + C - (a? - ft) \!A : 

3: 
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and that t = ^A -~~ defines a: as a rational function of t. Apply this result to 




evaluate / \/x*~~- 3x + 2 dx. 

fir/2 

11. Let I m = / sin x dx. Show that 

yo 



L2P (2p)M 2 Jlptl (2p + l)!l 

with (2p)t =246 (2p), (2p + 1)!! = 1 - 3 5 (2p + 1). 
10.20. Sequences and Series. A sequence of constant terms 

Sl, S 2 , S3, - - , S n , . . . (10.85) 

is simply a set of numbers which can be put into one-to-one corre- 
spondence with the positive integers. To completely determine the 
sequence, we must be given the specific rule for determining the nth 
term, n = 1, 2, 3, . . . . We may also look upon the sequence, {s n }, 
of (10.85) as a function defined only over the integers, so that/(n) = s n , 
n = I, 2, 3, .... 

DEFINITION 10.26. The sequence of terms {s n } is said to converge if a 
number S exists such that 

lim s n = S (10.80) 

n * oo 

Equation (10.86) states that for each e > there exists an integer AT(e) 
such that |iS Y s n \ < t for n ^ N(e). In general, the smaller e is chosen, 
the greater is the corresponding N. The existence of an integer jV for 
the given e implies that the sequence becomes and remains within an e 
distance of S. 

Example 10. 27. It is easy to show that the sequence 2, l-J, 1-J-, . . . , 1 + 1/n, . . . 
converges to the limit 1. We have |.s w - 1| = |l/n| < eif n > 1/c, so that N(c) = [l/], 
where [l/e] is the first integer greater than l/. 

In most cases we cannot readily determine the limit of the sequence 
even though the sequence converges. Cauchy obtained a criterion for 
the convergence of a sequence which does not depend on knowing the 
limit of the sequence. 

Cauchy' s Criterion. If, for any e > 0, an integer jV exists such that 
\s n+p s n \ < e for n ^ N and all p ^ 0, the sequence [s n \ converges to 
a unique limit S. 

This criterion is certainly necessary, for if lim s n = S, then \S s n \ 

n *< 

< /2 for n ^ N, \S - s n+p \ < e/2 forn S N, p ^ 0, so that \s n+p - s n \ 

< e f orn ^ N, p ^ 0. Conversely, assume that a given sequence satis- 
fies Cauchy's criterion. Choose e = 1, so that \s n + p s n \ < 1 for n ^ NI, 
p ^ 0. Hence \SN I+P sjyj < 1 for p ^ 0, and all terms in the sequence 
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following SN, are within a unit distance of SN^ There are only a finite 
number of terms preceding SN V so that the sequence must be bounded. 
From the Weierstrass-Bolzano theorem at least one limit point exists for 
the infinite set of numbers {s n }. Let us assume that two limit points 
exist, say, S and T, S < T. We choose e = (T - R)/2 and note that 
|.$ n +p $ n \ < for n *z N, p ^ 0. This means that after SN the terms 
of the sequence are clustered within an e distance of each other. Let the 
reader conclude that, if S is a limit point of the set {s n j, then T cannot 
be a limit point, and conversely. Thus Cauchy's criterion guarantees 
that the sequence has a unique limit, S, with Km s n = S. It should be 

n > oo 

understood that by first considering e = 1 we showed that the sequence 
was bounded. Then we deduced that only one limit point could exist. 

Example 10.28. If we return to the sequence of Example 24, we note that 

. .P 



n(n + p) 



n 



if n > l/, p ^ 0, since p/(p -f n) < 1. Again N [1/e]. 

By applying the Weierstrass-Bolzano theorem it is very easy to prove 
that a bounded monotonic nondecreasing sequence of real numbers always 
converges. As an example of the use of this result, we consider the 
sequence 

n = 1,2,3, ... 

We show first that the sequence is bounded. We have from Newton's 
binomial expansion 



- JL - - ~ 2) 1 4- -L 1 

Sn ~ l + U n) + 2!~ ^ + 3! ^ + " ' ' + ^ 

- 1 I 1 } l-l/*.i:(l-l/n)(l-2/n) , .... 1 

- l -h H- - 2! + 3| + -h - 

so that 
5n <l + l+^ + ^+ <l + l+| + ^ + ^+ '- =3 

Every term of the sequence is less than 3. Next we show that s n > s-i, 
so that the sequence is monotonic increasing. First let us notice that, 
if a. and ft are any two numbers such that a > ft ^ 0, then from 



we obtain 
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Now let a = 1 + l/(n - 1) > ft = 1 + 1/n, n > 1, so that 

fi n > a n - na n ~ l (a - ft) = a^a - n(a - ft)] 
yields 

S n = ( 1 + -) > _i ( 1 + -U: - ~~ T ) = _! n > 1 

\ n/ \ n 1 n I/ 

Thus the sequence converges to a unique limit, called e. 



lim ( 1 + - J = e = 2.71828 
n-+ \ n/ 



The infinite series 

Ui + U 2 + ' ' + U n + ' (10.87) 

with u n well defined f or n = 1, 2, . . . can be given a clear meaning if we 
reduce the series to a sequence. We define Si = HI, s 2 = HI + HI, . . . , 

i. n = 1, 2, 3, ... (10.88) 

If the sequence { s n } converges to a unique limit $, we say that the series 
converges to S and write 

00 

S = y w, (10.89) 

If the sequence fails to converge, we say that the series diverges. In 
general, one of three things will occur when we consider the convergence 
of a series: 

1. The series converges. 

2. The series increases beyond bound, and so diverges properly. 

3. The series oscillates and does not converge. 

00 

Example 10.29. The series f m j , m < 1, converges to the limit 1/(1 m) since 



8n ) m' - (1 - ra +1 )/(l - wi) -> 7j x as w-> oo. The series 1 + 1 -f 1 

^ (1 m) 

j-0 

_|_ . . . _|_ i _|> . . . increases beyond bound since s n = n. The series 1 1 + 1 
I _j_ . . . _j_ (_i)n+i _|_ . . . does not converge since ^[1 -f ( l) n+1 ] fails to 
converge. The terms of the sequence oscillate between 1 and zero. 

n 

If we apply the Cauchy criterion to the sequence s n = ^ u iy n = 1, 
2, . . . , we note that a necessary and sufficient condition for the con- 
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vergence of the series y u 3 is that for any c > an integer N exists such 
that 

" u, < e (10.90) 



n-\-p 



for n ^ AT, all p > 0, since s n + P = > u jy s n = Y ?v Formula (10.90) 

;=1 jI 

is equivalent to the statement that for any e > an integer N exists 



< e for n ^ A 7 " 



such that 
If we write 



.7 = 1 J = l 

with R n y Uj, we note that the series converges if for any c > 



we can find an integer Af such that all remainders, R nj are less than e 
for n ^ N. 

If we apply (10.90) for a convergent series with p = 1, we note that 
for any e > an integer N exists such that \u n+ i\ < e for n ^ N. Thus 
a necessary condition that a series converge is that the nth term of the 
series tends to zero as n becomes infinite. However, the harmonic series 



00 

V 

/ 

L^mJ 



i 

- diverges even though the nth term tends to zero as n becomes 



n-i 

oo 

Zl 
- diverges is seen by writing 
Yt 
n-1 



---++--- = i + i + (i + i) 
a + 1 + 1 + *) + (i + + A) + 



We now consider some practical tests to determine whether a series of 
positive terms converges or not.. 



1. Since u n ^ 0, we note that the sequence s n u } , n = 1. 2, 

j-i 
3, ... , is monotonic riondecreasing. If a constant, M, exists such that 
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s n = u 3 < M for all n, the sequence is bounded and hence converges. 



V i 

Thus the series ) -^ converges since 

= 1 

n 

,. = ;p <! + >+;!-,-... =2 

for all n. 

00 00 

2. If v n ^ u n ^ and N v n converges, then N w n converges. This is 

w = 1 n = 1 

the comparison test of Weierstrass. The proof is trivial and is left as an 
exercise for the reader. Conversely, if u n ^ v n ^ 0, and if \ v n diverges, 

OO 00 

then / u n diverges. The series / < a < 1, diverges, since \/n a > l/n 

n=l n = 1 

for n = 1 , 2, . . 

3. The Integral Test. Let &(x) ^ be a monotonic nonincreasing 
function defined for x 2> 1 . Let the reader show that 



^ ^(2) + ^(3) + - - + ^>(w) (10.91) 
From (10.91) the reader can deduce that } <p(n) converges or diverges 

l_j 

W = l 

according to whether / (p(x) dx exists or not. 

00 

V i . i 

The series ) , a > 1, is seen to converge since (p(x) = is mono- 

H=l 

tonic decreasing and lirn / - = - - for a > 1 . 
^oo Jj x a a - \ 

4. D'Alembert's Ratio Test. Suppose that a constant d exists such that 

00 

f or n ^ A r we have a n+ i/a n < d. If d < 1, ^ a n converges. From a AT +1 

nl 

a^, aAT +2 < da N +\ < d 2 a N , . . . , a^ +p < d p aAr, ... we note that 

00 00 

a n < ajv y rf r = a^/(l d). Thus y a n is bounded and so con- 



r = 
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00 

verges. If a n +i/a n > d ^ 1 for n ^ N, the reader can show that ^ a 



n-l 



diverges. At times it may be useful to consider 

lim ?=! = d (10.92) 

TJ oo ^n 

If d < 1, then a n +i/a n < d' < 1 for n ^ JV. Why? Thus the series 
converges. If d > 1, the series can be shown to diverge. The case 
d = 1 is indeterminate since examples can be found for which both con- 
vergence and divergence exist. 

5. The Cauchy nth-root Test. Assume that lim \/a n = fc < 1. Then 

n * 

f or n ^ N we have \/a n < fc' < 1, so that a n < (fc') n for n ^ N. Since 

00 00 

) (fc') w converges, of necessity ) a n converges. The reader should 

L< L* 

n-V n-l 

00 00 

show that, if lim \fa n = fc > 1, then / a n diverges. The series / ^ 

rl t QQ ^^J L.^J l 

__ T? ,:TI 

converges since ^/o^ ~ 9 ^ ^ 

00 

6. Raabe's Test. We consider the series^ ^ a n , a n > 0. If 

n-l 

(10.93) 



with lim n/(n) = 0, then y a n converges or diverges according as o- > 1 

n- +4 

n 1 

or or ^ 1. From (10.93) we have 



\n~ n - (n + 1) = er - 
[ a n+ i J 



lim n ~ n - (n + 1) 
Assume a > 1 so that <r ~ 1 = fc > 0. For n ^ N we have 

n fjL _ ( n + 1) > I 

or ^ [na n - (n + I)a n4 -i] > n+i (10.94) 

Let n - AT, ]\T + 1, AT + 2, . . . , N + p - 1, in (10.94), and add. 
We obtain 
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N+p-l 

^ a n+1 < ? (Na N - (N + p)a N + p ] < \ Na N 



k 



N + p-l 



Since (2/k)NdN is a fixed number, the series V a n+ i is bounded for 

n?N 

00 

all p. Hence ^ a n is bounded and so converges. If a < 1, then for 



n ^ N we have a n +i > -,- , o n . Thus 

n + 1 

^ N ^ N + 1 ^ 

aAT ' aN+2 "~ aAr4 " x 



^ N 
> - 



so that 



00 

and y a n diverges. It can also be shown that N ar, diverges if a = 1 . 



n-l 



V i 

As an example we consider > ^ We have 



a W 4-i n 2 n 2 n n 2 

Since lim n ^ = and o- = 2 > 1 , the series converges. The series 

" 



oo 

V i 

y - di 



- diverges since (7 = 1. 

Of particular interest are those series with alternating positive and 
negative terms. We consider 



(10.95) 

00 

In order that N ( l) n+1 a n converge, of necessity, a n > as n oo. 
Let us assume further that a n+ i g a n for w = 1, 2, . . . . We show that 
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the series given by (10.95) converges. First let us note that 

S*n = (i 02) + (a 3 a 4 ) + + (a 2w ~i a 2n ) ^ 

= ai (a 2 as) (cu as) (&2n-2 a 2n -i) ~" a 2n ^ #1 

Hence the sequence 82, 8*, . . . , $ 2n , ... is monotonic nondecreas- 
ing and bounded, and thus converges to a unique limit S. Moreover 

$2n+i = Sz n + ct2n+i, so that lim $ 2n +i = S since lim a 2n -fi = by 

n oo n oo 

assumption. Thus the alternating series converges to a unique limit 
8 0. 

00 00 

If / (~ l) M+1 a n converges but y a n diverges, we say that the series is 

n - 1 n = 1 

oo 

conditionally convergent. If ^ a n converges, we say that the alternating 

n~l 

series is absolutely convergent. 

Problems 



1. Apply the integral test to the sei les y j-r. TT-> and show that the 



series con- 



verges or diverges according as a > 1 or a ^ 1 . 

2. Show that the series 1 ^-f^ 7+- --f ( l) n+1 - -f converges. 

Z 3 4 n 

00 

3. Show that a necessary and sufficient condition for the convergence of y a n is 

n = l 

that for any e > an integer N exists such that a n +i -f a n + 2 H~ + flm+i| < c for 
m ^ n ^ ^ 

4. Show that, for |r| < 1, lim r = 0. //m: \r\ n + l = |r| |r| n < |r n 

W oo 

__ 00 

6. Show that lim \l -7 = so that > . converges. 
n-4oo *n\ L4 n\ 

n = l 

6. Show that the series 

i , ^ , ( + 1)0(0 + 1) , a( + !)( + 2)/8tf + DQ8 + 2) 
"*" T "7(7 + 1)2' "^ T"(T -f 1)(7 + 2)3! -r ' ' ' 

converges if 7 > a -f- and diverges otherwise. 

00 

7. Show that the series > __ diverges. 

w n \/n 

n**l 

Q n . , ,, . V |"(2^ - l)!!4n +31 2 , 

8. Consider the convergence of > ... = ^ with 

w L (^fiW;!! &n-\- L\ 
nl 

(2n)!! - 2-4-6 - (2n) 
(2n - 1)!! - 1-3-5 - (2n - 1). 
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00 

9. Assume a n ^ a n +i > for n = 1, 2, 3, . . . , and further assume that y a n 

n-l 

oo 

converges. Show that lim na n = 0. Why does / - diverge? 

n-+> ^ n 

n-l 
oo oo n n 

10. Let y s n converge to S, y t n converge to T. Define <r n = ( / *) ( / ^)> 

n = 1 n = 1 1 .; 1 

and show that hm cr n ST. 

00 

V ( l) n+1 

11. Show that the series / is conditionally convergent. Is it possible 

Lj ^ 

n=l 

to rearrange the terms of this sequence so that the resulting sequence would converge 
to TT or any other number? 

12. Consider the sequence k\, k^ . . . , k u , . . . , defined by k l+ i 2 \///l -f k lt 
< ki < 1, ? = 1, 2, . . . . Show that k l+1 > k> and < fc, < 1 for % = 1, 2, . . . . 
Then prove that hm fc = 1. 

l-^ oo 

13. Consider the sequence of complex numbers, a n + b n i, n = 1, 2, 3, .... If 
the sequence ja n | converges to a and the sequence {b n \ converges to 6, show that the 
sequence \a n -h ^n*J converges to a + 6i in the sense that for any > an integer 
N(e) exists such that 

| (a + bi) - (on -h Ml < c forn ^ N 

14. Show that the convergence of a sequence can be made to depend on the con- 
vergence of a series. Hint: 

S n = S] + (S 2 S)) -h (3 S Z ) + ' * ' + ( - S n _i) 

10.21. Sequences and Series with Variable Terms. Let us consider a 
sequence of functions 

/i(x),/,(x), . . . ,f n (x), . . . (10.96) 

with / n (x), n = 1, 2, . . . , defined on the range a ^ x ^ 6. For any 
number x = c of the interval (a, b) we can investigate the convergence 
of the sequence of constant terms 

/i(c),/ s (c), . . . ,/(c), . . . (10.97) 

If the sequence of (10.97) converges, we can write 

lim f n (c) = A c (10.98) 

n 

where A c is a constant which obviously depends on the number x = c. 
If the sequence of (10.96) converges for all x on the interval (a, 6), we 
obtain a set of numbers [A x \ which defines a function of x on (a, ft), 
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for to each x there corresponds a unique A x . We write 

lim f n (x) = f(x) a ^ x b (10.99) 

n * 

= A x . 



Example 10.30. We consider the sequence |/ n (/)l with f n (x) = x/(\ + nx), 
n - 1, 2, . . . , x ^ 1. At x - we have /,,(()) = 0, n = 1, 2, 3, . . . , so that 

hm / n (0) 0. For x ^ it is obvious that lim / n (.r) == lim r-^ = 0. The 

n-> oo n > * n-> oo * ' nx 

limiting function for this example is/(jr) = 0, ^ x ^ 1. If we consider the sequence 
1 /()! with/nCr) - 1/(1 + wJ 1 ), w = 1, 2, 3, . , JT ^ I, we note that 

hm /(()) = hm 1 = 1 

H > oo 71 > oo 

whereas, for x j& 0, Inn f n (x) = 0. The limiting function in this latter case is 

n * 

defined by /(x) = OforO < x ^ 1 ; /(0) = 1 This function is discontinuous at x 
Let the reader graph a few terms of this sequence. 

Example 10.31. Let us see how rapidly the sequence \x/(\ -f nx) } converges to 
f(x) as 0, : x ^ 1. At x the convergence occurs immediately since f fl (0) = 
for M 1, 2, 3, .... For x ^ we have 

!/(*) - /(x)| = l ~ < * do.ioo) 

c > and arbitrary, provided (I + nx)/x < 1/cor n > 1 A I/JT Thus for n. > 1/c 
(10.100) will hold since l/e > 1/e 1 /x If A r is the first integer greater than 1/e, we 
shall have \f n (x) f(x)\ < * for n ^ N It is important to note that an integer JV 
can be found which is independent of x for ^ j- ^ 1 We say that the sequence 
converges uniformly to its limiting value f(x) s 

DEFINITION 10.27. The sequence {f n (x)\ defined for a ^ x ^ b is said 
to converge uniformly tof(x) if for any e > an integer N exists such that 

\f n (x) -f(x)\ < t (10.101) 

holds for all x on (a, b) provided n ^ N . 

Uniform convergence is essentially the following: If one imagines that 
a circular tube of arbitrary radius e > surrounds f(x) throughout the 
length of the definition of f(x), then uniform convergence guarantees that 
an integer N can be found such that, for n ^ N t f n (x) will also lie inside 
the tube for all x on (a, b). In other words, all terms of the sequence 
from the Nth term onward lie inside the e tube throughout the length of 
the tube. The value of N, in general, depends on c. Since the curves 
/ n (x), n = 1, 2, 3, . . . , are two-dimensional, the tube need only be 
two-dimensional. It is not necessary that /(x), and hence the tube, be 
continuous. 



Example 10.32. Let us consider the sequence jl x n \, n = 1, 2, 3, . . . , defined 

|. 



for - x -J. From Prob. 4, Sec. 10.20, we have that lim x n - if \x\ 
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Thus J(x) - lim f n (x) - lim (1 - x n ) - 1 for - ^ . We wish to deter- 

n > n oo 

mine whether the convergence is uniform. We have |/ n (o?) f(x)\ = |x| n < (|-) n , 
since |z| ^ ^ is our range of definition. Hence we can make \f n (x) f(x)\ < if we 
choose n sufficiently large so that ( ) n < e. This can be done if we choose n > 
In /(ln 2 In 3) for < < 1. If e ^ 1, we can choose n ^ 1 Thus the sequence 
[fn(x)\ converges uniformly to f(x) on the range J- ^ x ^ ^-. 

Example 10.33. We consider the sequence \nxe~ nT \ denned for ^ x ^ I, Let 
the reader show that 

f(x) = lim nxe~ n * =0 ^ x ^ 1 

n * 

We show, however, that the sequence does not converge uniformly to f(x). First we 
note that 

\f n (x) -/(*)! =nxe~* ^x ^l 

If the convergence were uniform, then for e = 0.01 we would be able to find an integer 
N such that nxe~ nx < 0.01 for n ^ N arid for all x on ^ a: ^ 1. In particular 
Nxe~ Nx would be less than 0.01 for all x on (0, 1). If we choose x = l/N, we have 
N (l/N)e~ N ' (l/N) e~ l > 0.01, a contradiction. Hence the sequence fails to con- 
verge uniformly to f(x) on the range ^ x ^ 1 . 

The following theorems will emphasize the importance of uniform 
convergence : 

THEOREM 10.29. Let (f n (x)} be a sequence of continuous functions 
defined for a ^ x ^ 6. If the sequence converges uniformly to f(x) on 
(a, b), then/(#) is continuous on (a, 6). 

The proof is simple. Let x = c be any point of (a, 6), and choose any 
6 > 0. From 

f(c + ft) - /(c) = [/(c + h) - f n (c + h)] 

+ [U(C + h) - f n ( C )] + (f n (c) ^ /(C)] 

we have 



+ |/.(c + A) - /.(c)| + |/ m ( c ) - /(c)| (10.102) 

From uniform convergence we note that an integer N exists such that 
\fn(x) f(x) \ < e/3 forn ^ N and for all x on (a, b). Applying this result 
to (10.102) yields 

|/(c + h) ~ /(c)| < | + |/^(c + h) - /*(c)| 

Since /N(X) is continuous at x = c, we have \/N(C + h) /AT(C)| < /3 for 
|A| < 6, 5 > 0. Hence for any e > a 6 > exists such that |/(c + h) 
/(c)| < 6 for |&| < 8. Q.E.D. 

Example 10.33 shows that f(x) can be continuous even though the con- 
vergence is not uniform. Uniform convergence is a sufficient condition 
for continuity to occur, but is not necessary. 
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THEOREM 10.30. Let \f n (x)} be a sequence of continuous functions 
defined for a ^ x ^ 6. If the sequence converges uniformly to/(x), then 

f b f(x) dx = f b lim f n (x) dx = lim f b f n (x) dx (10.103) 

Ja Ja _> __>, Ja 

The proof of (10.103) proceeds as follows: Define R n (x) by 

= /(*) + R*(x) (10.104) 



Since /(x) is continuous from the previous theorem, R n (x) is also continu- 
ous, for it is the difference of two continuous functions. From (10.104) 
we have 



f b f(x) dx = l b f n (x) dx + f b R n (x) dx 

Ja Ja Ja 

or \[ b f(x) dx - f b f n (x) dx = l b R n (x) dx 

I Ja Ja Ja \ 



From uniform convergence we note that for any e > 0, and hence for 
t/(b a) > 0, an integer N exists such that \R n (x}\ < t/(b a) for 
n ^ N and for all x on (a, b). Thus 

f b f(x) dx - f b f n (x) dx r \R n (x)\ dx < e 

Ja Ja Ja 

for n ^ N. This means that the sequence / f n (x) dx (Converges to 

Ja 

f f(x) dx, so that (10.103) results. Q.E.D. It should be emphasized 
that Theorem 10.30 was shown to be true provided a and b are finite. 
Example 10.34. We consider the sequence of Example 10.31 We have 

i f 1 x 7 r 1 f 1 /\ l \ i 

Inn / -.--. - dx = hm - / ( 1 .-- - I dx 

_ JO 1+nx n^^njo V 1+nx/ 



/I 
f(x) dx = 0, which checks the results of Theorem 10.30 

since the convergence was seen to be uniform. 

Example 10.35. Let the reader show that the sequence \nxe~ nx *\ does not converge 
uniformly on the range s x ^ 1 but does converge to f(x) 2= 0. Now 

f l nxe~ nx * dx -^" n ' f * = |(1 - -) 
so that lim f nxe~ nx * dx = ^ ^ f f(x) dx = 

n > oo '^ '^ 

On the other hand, the sequence \nx/(\ -\- n 2 x 2 ) j does not converge uniformly on the 
range ^ x ^ 1 but does converge to f(x) s (), and 
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Uniform convergence is a sufficient condition to apply (10.103), but it is not a neces- 
sary condition. 

THEOREM 10.31. Let {/(#)} be a sequence defined over a ^ x ^ b 
which is known to converge to the constant /(c) at x = c, a ^ c ^ b. 
Assume further that the sequence {/(#)} converges uniformly to g(x) 
on (a, b) with/^(x) continuous on (a, 6) for n = 1, 2, .... We show 
that the sequence \f n (x) } converges uniformly to a function f(x] such 
that /'Or) = g(x) for a ^ x ^ 6, that is, 



lim f' n (x) = f'(x) = - ( lim f n (x)] (10.105) 

n > oo (*X n oo 

Since the sequence \f' n (x)} converges uniformly to g(x) we have from 
Theorem 10.30 that 



g(x)dx = lim f' n (x)dx = lim [f n (x) - f n (c)} (10.106) 

n -+ x Jc n __>* 

From (10.106) the reader can readily deduce that 
lim f n (x) = f'g(x)dx+f(e) 

n->oo Jc 

Thus lim / n (^) = /(x) exists, and since g(x) is continuous, we have 






f(x) = g(x). From /'() = /;() + R n (x) with |B n (a?)| < e for n ^ ^V 

let the reader show that the sequence \f n (x) } converges tof(x) uniformly. 

If / n (x), n 1, 2, 3, . . . is defined for a ^ x ^ J>, we say that the 

oo oo 

series N / n (x) converges at a: = c if the series of constant terms Y / n (c) 

n-l 

00 

converges, a ^ c ^ b. We write /(c) = Y /n(c), where 



n-l 



n-l 



/(c) = lim / 4 (c) 
^- ffi 

00 

If ^ /n(^) converges for all x on (a, 6), we write 

n=l 

oo n 

f(x) = Y /(*) = lim V MX) (10.107) 

^. n > ,^, 

n 1 fc=l 

It is convenient to express f(x) as a finite series plus a remainder which 
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obviously contains an infinite number of terms, 



R *W (10.108) 



with R n (x) = f k (x). 



DEFINITION 10.28. The series ^ f n (x) is said to converge uniformly to 

f(x) on (a, b) if for any e > an integer N exists such that 
\R n (x)\ < e for n N, a .r b 

In terms of the Cauchy criterion we note that uniform convergence exists 
if for any c > an integer N exists such that 

\S n+p (x) - 8 n (x)\ < (10.109) 

n 

for n ^ N, all p > 0, and for all x on (a, 6), with s n (x) Y fk(x). 

The results concerning continuity, integration, and differentiation of 
uniformly convergent sequences apply equally well to any uniformly con- 
vergent series. To prove these results, one need only change the series 
into a sequence [see (10.88)]. 

Before constructing a few tests for determining the uniform conver- 
gence of a series we prove a result due to Abel. Let Ui, Uz, . . . , u n 
be a monotonic nonincreasing sequence of positive terms, u 3 +\ ^ u 3 , 
Uj > 0. Let ai, a 2 , . . . , a n be any set of numbers, with M and m the 
maximum and minimum, respectively, of the set 

n 
&1, Oi\ -f- &2, #1 H- C&2 4" ^3) ) / tt < 

We show that 

n 

mu, ^ Y OM g Mui (10.110) 

n 

Proof. Let si = ai, 2 = i + 02, . . . , s n = N a t -, so that ai = si, 
a 2 = 2 ~~ Si, a 3 = 6*3 $2, , a w = s n $ n -i. We write 

n n 

Z^r\ 
djUj = / (Sj Sji)U 3 So == 

/w 

-f ' + (S n ~ n-l)Wn 
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Since u 3+ i ^ u } for j = 1, 2, . . . , w 1, and since w n > 0, we note that 



+ 



71 

V > 

L, 3 3 = 



w 2 ) + M(^ 2 MS) + 

u 2 ) + m(u z u s ) + 



which proves (10.110). 

A. AbeUs Test for Uniform Convergence. y a n w n (x) converges uni- 

r/-l 

formly on (a, 6) if 

00 

] . N a n converges. 

n=l 

2. w n (x) > and u n (x) ^ u n +i(x), a g x ^ fe, n ^ 1. 

3. |wi(x)| < A; for all x on (a, &). 



Proof. Since N a rt converges, an integer AT exists such that 
< c//c for all p > 0. From (10.110), Abel's lemma, we have 

N + p 



n-AT+1 



x) ^ ~ Ui(x) 



a n u n (x) 



for all p > 0. Q.E.D. 

B. The Weierstrass M Test. Assume |w n (^)| ^ M n = constant, for 

00 00 

n = 1, 2, 3, . . . , a ^ x ^ 6. If N Af n converges, then Y w n (x) 

n \ n 1 

converges uniformly on (a, 6). Proof. For any e > an integer N 
exists such that 



M n < for all p > 



But 



Q.E.D. 



C. ^ u n (^)' ; n(^) is uniformly convergent on (a, b) if: 



n-l 
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n 

1. / u t (x) < M = constant, for all n and for all x on (a, 6). 

'A 

ao 

2. ^ | y n + i(#) v n (x)\ is uniformly convergent on (a, 6). 

n-l 

3. v n (x) > uniformly as n * QO . 

n 

Proof. Let s n (z) = } u l (x) \s n (x)\ < M from (1) 



Now 



t-1 



= (Sn+i Sn)V n +l 

= SnVn+1 + Sn+l 

n+p-1 

- S n \ ^ | w | |i' n +l| + 



kn 



' ' * + S n+p V n+p 



n + p-1 



V r ~ Vr+i\ 



Since y w (a;) uniformly as n > oo ? we have |v w+ i| < e/3Af, |^ n +p| < 

n + p-l 

e/3Mforn ^ A^I. From (2), V 1^ ~ ^il < e/3Mforn ^ ^ 2 . For 

r = n-f 1 

A^ equal to the larger of NI, N 2 we have 

\S n + P - S n \ < e n^ N, all p > Q.E.D. 
Example 10.36. We consider 



for < 8 ^ x ^ R. We have <p(j 

(1) a, = 
(2) K +I -,!- | ( + 1)- 



} a n v n (x) wi- 



with 



< 2 



00 

Y 



/nH 
Jn 

00 

Zv 
n ( +1) ~ R Z 



x > 



n-1 
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00 00 

Since \ n ~^ 8+l) converges, the Weierstrass M test states that ^ \v n+ i v n \ con- 

n-l n-l 

verges uniformly for < 5 ^ z ^ #. 

(3) n~ x ^ n~ 5 -> so that n~* -> uniformly 
From (C), <?(x) converges uniformly for < 5 5* x ^ /2. 
An important type of series is the power series 



P(x) = > a n x n (10.111) 

L-t 
n-O 

If we apply the ratio test, we note that the series of (10.111) converges if 

lim 



a n x n 



or \x\ < lim 



(10.112) 



We call R the radius of convergence of the series. The word "radius" 

00 

comes into play if we consider the complex series P(z) = Y a n z n , 

n-O 

z = x + iy, which converges for \z < R. If P(x) converges for \x\ < /^, 



00 

it follows that the series Q(x) = } \a n \x n also converges for \x\ < R 

n = 

[see (10.112)]. The ratio test fails to tell us anything concerning the end 

oo oo 

points x = R, x = R. The convergence of Y a n R n and Y a n ( R) n 

n0 n=0 

must be examined by the methods of Sec. 10.20 or by other means. 

00 00 

Example 10.37. (1) y n\x n converges only for x = 0. (2) y x n converges 

n=0 n=0 

V x n 
for 1 < x < 1 . (3) } converges for all finite x. 



If P(x) = a n x n converges for \x\ < R, it is very easy to show that 

n = 

a n x n converges uniformly for x\ ^ R\ < fi. We have for \x\ ^ /?i 



n-O 
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00 

that \a n x n \ ^ kn|/Zj, n = 1, 2, 3, . . . , and since } \ a *\R* converges, 

n-O 

the Weierstrass M test yields uniform convergence. Since each term of 
the series is continuous, P(x) is continuous inside its region of conver- 
gence. Let the reader show that P'(x) has the same radius of conver- 

00 

gence as P(x). Why is it that P'(x) = Y na n x n - lt ! 

n-l 

A result due to Abel concerning power series can be stated as follows : 

00 00 

If Y a n converges to s, then Y a n x n is uniformly convergent for 

n-O n-O 

00 

^ x g 1, and lim Y a n x n = 8. The proof depends on the result 

*-! W 
n 

00 

given by (10.110). Since N a n converges, we know that for any c > 

n-O 

an integer N exists such that |a n + a n +i + + a n +p| < e f or n ^ AT, 
all p ^ 0. For ^ a; ^ 1 we know that x w , n = 0, 1, 2, 3, . . . , is a 
monotonic nonincr easing sequence, so that 



^ ex" ^ c 



[see (10.110)]. Hence the series is uniformly convergent for g x ^ 1 

oo 

sothatPOr) = Y a n x n is continuous on the interval ^ x ^ 1. Hence 

n-O 

oo 

lim P(x) = P(l) = Y a n = s. For example, it is known that 

X >1 ~* 

n 

to (i+*).V(-i)^ 



n-l 



V^ (_]_)n-fl V"V /_^]n-fl 

for \x\ < 1. Since > - - - converges, we have In 2 = ) - - - --- 

n-l n-l 

00 

The converse of Abel's theorem is not true. If P(x) = Y a n x n con- 

n^O 

00 

verges to s as x -* 1, we cannot say that Y a n converges to s. An 



n-l 
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00 

example due to Tauber is as follows: P(x) = ^ ( l) n x n = 1/(1 + x) 9 



00 

and lim P(x) = . However, } ( l) n is not convergent. 

*->! ^ 



w-0 

Problems 



oo 

1. Show that 1/(1 -M 2 ) = /^ (-l) n < 2w converges uniformly for t s < 1. 



n-O 

Integrate, and show that 



T 2n-fl 

-^5rn 

n0 



,. 4 y (-1)-^ 

^ 2n 4- 1 

00 

2. If lim \/ch, T exists, show that P(z] = y a n (z zo) n converges for 

n w ^rt 



|z -2c| 



3. The series ^ ?V N (x) is said to be absolutely convergent for a ^ z ^ b if y 

n=0 nO 

ao 

Z# 2 
n . $ is absolutely convergent 
(1 ~t~ x ) 
n-1 
for | a: | ^ 1 and that S(x) is not uniformly convergent for \x\ ^ 1. 

00 

4. Show that ) Sm n . , is uniformly convergent for 7r/4 < x ^ 3^/4. 

Z^ zn H- 1 

By considering x = ir/2 show that the series is not absolutely convergent for T/4 <j 
x ^ 3ir/4. 

5. Show that 

/" i In x , V T 1 i j V 1 T Z 

/ = rfx = ) / X n In a? dx ) 7 r rr^ = 

Jo I - x Lt Jo Li (n + I) 2 6 

n0 n-O 

00 

V^ iT^ 

6. Prove that the series > rr- z-i is uniformly convergent for all x. 

^ I ~r w x 



. 
o I x x 4 



7. Prove that 

v^ 

8. Let S(x) > w n (a;) converge uniformly f or a x &, and assume further 

n-l 



456 ELEMENTS OP PUEE AND APPLIED MATHEMATICS 

eo 

that lim u n (x) L n for all n, u n (x) not necessarily continuous. If / L n con- 

a->e ^-* 



n-1 



OB 

verges, show that lim S(x) = y L n . 

X >fi ^ 



n-1 

9. Let a mn ^ for m = 1, 2, 3, . . . , n 1, 2, 3, . . . , and suppose a constant 
K exists such that 

N M 



00 00 

for all M, N. Show that ^ } a mn exists and that 



*mn (10.113) 

n = lm 1 T/I = 1 n 1 

Write the elements {a mn l as a square array, and interpret (10.113). 

00 

10. If P(z) - y a n z n and 

n-O 

P(z + w) - 



a =0 r-0 



converges absolutely, that is, ^ ^ |a n | T r j |?i;| r |0| n ~ r converges, show that 



n-0 r = 

00 00 

P(z + w) - a s+r z'^ (10.114) 



Apply (10.114) to E(z) - and show that J57( + w;) - E(z)E(w). 

L^ fi 

n=0 

00 

11. A series ^ w n (^) is said to be "boundedly convergent" for the interval 
nl 

00 

a ^ x ^ 6 if y u n (x) converges for all x on (a, b) and if a constant M exists such that 



< M for all x on (a, 6). The series is said to be uniformly continuous on 

n^l 

(a, 6) except for the point c if the series converges uniformly on the intervals a x 
c 8, c + 8 ^ x ^6, however small 5 may be, 8 > 0. Show that if a series is 
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uniformly convergent on a ^ x ^ b except for a finite number of points and if the 
series is also boundedly convergent then the series may be integrated term by teflm. 
12. For what ranges of x do the following series converge uniformly? 



V * Y J_ 

L, x* + n 2 Lf x*n* 



n-l n-1 

13. Second Law of the Mean for Integrals (Bonnet). Consider 

n-l 



/ f(x)<f>(x) dx } 



y-o 

Let <p(x) be positive and mono tonic nonincr easing on (a, 6), and consider the set of 
numbers 

S Q =f(a)(xi - a) 
Si - f(a)(xi - a) 



,=0 

Let A and B be the minimum and maximum values of the set S t , i = 0, 1, 2, . . . , n, 
apply Abel's result of (10.110), and show that 



dx 

Ja 

If f(x) is continuous, show that 

dx - ^(a) P/W ^ a ^ ^ ^ 6 

Ja 

Consider ^>(x) <p(b) 3* 0, ^(x) monotonic nondecreasing and positive, and show that 

dx = v(a) P/(*) dx + ^(6) [ 6 /W dx (10.115) 



f b f(x)<f>(x) 

jo 



10.22. Dini's Conditions. A simple example shows that (10.103) does 
not necessarily hold if one of the limits of integration is infinite. Let us 
consider the sequence j/ n (x) } with f n (x) (2x/n*)e~** /n *, n = 1, 2, 3, 
. . . ,withx ^0. It is easy to see that lim f n (x) = f(x) s Oforx ^ 0. 

n > 

By setting f' n (x) = one easily shows that the maximum value of f n (x) 
for x ^ occurs at x = n/\/2 so that |/ n (a?)| ^ V^A (l/^) < 1/w < c 
f or n ^ N = 1 + [1/e]. Hence the sequence converges uniformly to its 
limit f(x) = for x ^ 0. On the other hand, 

r / o r 

lim / / n (x)dx = lim / ^e-* !/ "'<fe = lim 1 = 1 

n-*w JO n JO ^ n > 

* I lim /(*) do; = / " dx = 
yo - yo 
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We now state and prove a result due to Dini. We can write 



(10.116) 

n^\ n^ I 

if the following conditions are fulfilled : 

1. u n (x) is continuous for # 2> a, n = 1, 2, 3, . . . . 

2. lim v n (x) = y n () exists for n = 1, 2, 3, . . . , with 



u n (t) dt 

3. Y u n (x) converges uniformly for a ^ x ^ -Sf , X arbitrary but finite. 

n-l 

00 

4. 2, v n(x) converges uniformly for x * a. 

n-l 

Proof. From (3) and (2) we can write 

oo eo oo 

/X V V* 

\ /)/ f /y | /7 / V* \ 

/ t*nv"k/ ^**k """ / 
I L-4 Lf 

/V / \ ^ r T X V /\^_r V 
y Wnv*^) dx = lim I y ^n\x) dx lim y 

00 00 

To prove- (10.116), we must show that lim Y y n (X) = Y y n (), since 

X ~~ >8 n-l n-l 

y yn() = y \ u n (x)dx. It is first necessary to show that y y n () 

Lj Lj Jo. Li 

n-l n-l n-l 

exists by making use of (2) and (4). From (4) an integer N can be found 
such that 

oo N 



nl 

Hence 



C ^ for all X 

4 ^ 

n-l n-l 

AT 

From (2), lim v n (X) = y n (oo)forw = 1,2,3, ... ,tf,sothat ^ K(X) 

tfn()] < c/4 for Z ^ Zo, since the limit of a finite sum is the sum of 
the limits. 
The identity 

N N N 



n-l n-l n-l n-l n-l 
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shows that for any > we can find an integer N and then an XQ such 
that 

N 

^n(oo) <| (10.117) 

n-l 

for X ^ XQ. By the same reasoning we have 

N + P 



V i; n (X) - V . 

n-l n-l 



for X *z Xi and p a positive fixed integer. Choosing any X larger than 
both XQ and Xi yields 



n-l 



< 6 



(10.118) 



Now (10.118) holds independent of any X since no X appears in (10.118), 
so that for any c > an integer N exists such that (10.118) holds for 
all p > 0. This is exactly the Cauchy criterion for convergence, so that 

00 

/ ^n() exists. Hence for 6/2 > an integer NI exists such that 



n-l 



~Z" B 

n-l 



(10.119) 



Choosing the larger N of (10.117) and (10.119) and combining (10.117) 
with (10.119) yields 

00 00 

(10.120) 



n-l n-l 

for X ^ XQ. Formula (10.120) is just the statement that 



lim 



Q.E.D. 



Example 10.38. We consider the Bessel function 



n-0 



and show that / e"*Jo(x) dx I/ A/2. The nth term of the series c""/o(a;) is 
yo 
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, 
nlnl 



u n (x) e~* , / > and we see that u n (x) is continuous for x ^ 0. Also 



and ,(-)- lim *(*) 



3_no ~ """ j\j 2 2n n!n! 



Let the reader show that lim v n ( ) - 0. By the ratio test it is easy to determine 

n K*> 

that e~*J r o(a;) converges for |z| ^ X, X arbitrary. Finally it remains to show that 

00 

y v n (x) converges uniformly for x ^ 0. If we write 



n-0 



n0 



n 



we cannot show uniform convergence since the series of constant terms in (10.121) 
diverges. However, through integration by parts the reader can readily verify that 



M /o* 



so that y v n (x) is a series of alternating terms with |t> n +i(:c)| < |t> n (#)| for x > 0, 
y n (0) 0, n BS 0, 1, 2, . . . . For such a series the remainder after n terms of the 

00 

series y v n (x) has the property that 

\Rn(x)\ \Vn+l(x)\ 02n + 2 /; ? nT/J . 1 S"l (10.122) 



Since the right-hand side of (10.122) tends to zero as n becomes infinite, we note that 
v n (x) converges uniformly for x ^ 0. Applying (10.116) yields 



n-0 



/"* V (^D n (2n) 

Jo e J W tf:c - 2, 2 2 n!n! 



n-0 
In the next section (Example 10.40) we shall show that 



n-0 



For x - 1 we have / * e~ x Jo(x) dx - 
yo 
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Problems 



go 00 

1. Let S(x) = y , * > a; ^ 0. Show that f S(x) dx - i y 1- 
Li (x 4- n) 3 JO 2 Li n* 



n-l n-l 

2. We wish to determine f(x) such that 

1 s. 



oo 

Assume f(x) = y a n :c n , integrate formally, make use of (10.123), and show that 

f(x) = JQ(X). Then justify your work. 

3. Consider f n (x) = n 2 xe~ nx , n = 1, 2, 3, . . . , show that lim I / n (x) dx ^ 

n > ^ 
/ oo 

/ lim / n (#) rfa;, and determine which one of Dini's conditions is not fulfilled, 
jo M 

4. Consider f n (x) = nx/(l + n*x 2 ), n == 1, 2, 3, . . . , and show that 



f 

= \ li 

JO n 



lim / / n (z) dx = \ lim / n (x) da; 



Do Dini's conditions hold for this case? 



10.23. Taylor Series. Let f(x) be a function denned on c ^ x ^ d 

such that f'(x),f"(x), . . . J M (x) exist on (c, d) with/< n >(:r) fl-integrable. 
We note that, for c^a^x^dorc^x^a^-d, 

/ / (n) (0 * = f (n ~ l) (x) - f (n - l} (a) 

J a 

t* f x 

/ dx / /->(0 dt = /<"- 2 >(z) - / ( "- 2 >(a) - / ( "-"(a)(a; - a) 

yo Ja 

I dx \ 

Ja Ja 



dx 



- /<-(a)(* - a) - 






Continuing this integration process yields 

dx t X dx r "V-HO dt = f(x) - /(a) - f'(a)(x - a) 

Ja Ja 



a 

"' 



V ' 3! 

so that /(re) = V /W( ) (x ^ a)r + 



' (10.124) 

.(*) = r dx [ x dx r - f x f^(t)dt 

Ja Ja Ja Ja 
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If M is any bound of |/ (w) (#)l on the interval (a, x), we have 

\R n (x)\ r\dx\ r\dx\ i * - - r M\dt\ = M \* 7 q| * (10.125) 

Ja Ja Ja Ja "> 

Another form of R n (x) can be obtained if we note that F(x) defined by 

1 (-* 

(n - 1)! Ja 
yields 

p>(x) = / fM(t)(x - 1)"^ dt F'(a) = 

/* v*^/ ==: 7 Q\"f / ^ ^ ' ^*^ ' \^/ == ^ 

(^71 oj ! J a 



by applying the results of Prob. 4, Sec. 10.17. Integrating 

FW(x) = /""(a;) 
times over the range (a, x) yields F(x) = R n (x), so that 



(10.126) 



The inequality of (10.125) results immediately from (10.126). 

If it is known that f(x) has derivatives of all orders and if it can be 
shown that lim R n (x) = 0, then (10.124) yields the important Taylor- 
series expansion of /(#) about x = a, 

00 

/(*) = V/ (r) () ^r^ (10.127) 



It must be emphasized that (10.127) cannot be used as an expression for 
f(x) unless R n (x) > as n * oo (see Example 10.41). A special case of 
(10.127) occurs if a = 0, with 

f(x) = / (r) (0) (10.128) 



Equation (10.128) is the Maclaurin-series expansion of j(x) about the 
origin. 
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Example 10.39. Consider f(x) = e* with a 0. All the derivatives of f(x) exist 
for \x\ ^ X, X arbitrary. For \x\ X we have |/ (n >(z)| - \e*\ e x , so that 



The reader can verify that lim r = 0. A simple proof of this statement occurs if 
n _>flo ni 

V X n 

one considers the series, ) p which is seen to converge by the ratio test Hence 
LJ w! 

n*0 

the nth term, of necessity, must tend to zero. Thus |/? n (x)| as n > for all z, 

00 

and * " ^ ^ since / {r) () - 1 for r - 0, 1, 2, . . . . 

r-0 

Example 10.40. Consider /(a;) (1 -h a?)~i. The reader can verify that 



w 0, 1, 2, 3, . . . . Moreover 



1,00(3)1- M.', 



and for |o;| ^ <r < 1 we have 



Let the reader verify that lim R n (x) = 0. Thus 



n-0 



The series of (10.130) converges for x = 1 and diverges for x 1. The reader can 
verify directly that lim R n (l) 0. 

n > 

Example 10.41. Consider /(z) - g- 1 /*^ ^ ^ 0, /(O) - 0. We have 



To compute / ; '(0), we note that 



x _>0 a: 3.^0 a; 

It can be shown that / (n >(0) - for n - 0, 1, 2, 3, . . . . Hence the Maclaurin 

00 

V x n 

series / / (a) (0) ; converges to zero for all values of x since every term of the series is 

4-4 fl! 

n-0 

zero. Returning to (10.124), we note that f(x) - e"* 1 /* 8 - R n (x) for all n so that 
R n (x) does not tend to zero as n becomes infinite. The Maclaurin series of f(x) does 
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not converge to f(x) for this example. We note that the convergence of a Taylor- 
series expansion of a f unction /(z) does not guarantee that the series converges to /(a;). 
One must always investigate the remainder, R n (x). 

Another form of the Taylor series can be obtained if we replace x by 
a + hj so that 

<r) (a)* (10.131) 



If we allow a to vary by replacing a by #, we obtain 



/(* + A) -/<"(*) *J (10.132) 

r-0 

Equation (10.131) is very useful if we know / (r) (a) for all r and if we 
wish to find an approximate value of f(a + h). If only a finite number 
of terms of the Taylor series are used in approximating /(a + A), an esti- 
mate of the size of the error can be obtained from (10.125). 

Example 10.42. In Example 10.39 we saw that \R n (x)\ ^ e*\x\ n /n\ for the Mac- 
laurin-series expansion of *, \x\ ^ X. If we wish to find an approximate value of e, 

we let x - 1 and note that \R n (l)\ e/n\ < 3/n!. For n = 8 we have |# 8 (1)| < 
7 

3/8! < 0.0001, so that } -^ 2.718 yields a value of e accurate to three places. 

r = 

Actually e - 2.71828 .... 

The Taylor-series extension to a function of more than one variable is 
not very difficult. Consider /(x 1 , x 2 , . . . , x n ) as a function of the n 
variables x 1 , x 2 , . . . , x n . Let h 1 , /i 2 , . . . , h n be any set of constants, 
and define p(f) by 



= f(x l + h l t, x* + h*t, r . . , x n + h n t) 

= /(u 1 , u 2 , . . . , u n ) u* = x i + M i = 1, 2, . . . , n 

(10.133) 

We consider x 1 , x*, . . . , x n as constants temporarily so that p(t) is 
looked upon as a function of the single variable L Let us assume that 
tp(t) has a Maclaurin-series expansion which converges for t = 1. We 
have 



<" n! 

n-O 

Now <b_dj_ du" _ df 

* dt ~ du* ^T~~toT" 
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and, at t = 0, 

-* - 



and ( -^2 ) = tt i h a W. Continuing in this manner yields 






For t = 1 we obtain 



\ . . . , x" + A) = f(x\ x\ . . . , 

- 



-^ ' + dO.134) 



For a function of two variables (10.134) becomes 
f(x + h,y + k)= f(x, y) 



where + * 

r-O 

Problems 

00 

V ( I) n x 2n+1 

1. Show that sin x = > -775 TTTT holds for all values of x. 

Li (*n + 1)1 

n-O 

00 

Y* f _ l)a;2n 

2. Show that cos = ) /0 N . holds for all values of x. 

^ (/n; i 

n = 

00 00 

v^ x 2n+1 V a; 2n 

3. Show that sinh x = ) 75 7-=^-. and cosh x - ) 755-^ hold for all values 

xlv (^Jw -t~ 1;! w v* w / 

n0 n=0 

Of X. 

00 

4. Show that In (1 -f x) - Y C^ 1 )*"" 1 ** holds for -1 < a: g 1. 



466 ELEMENTS OF PURE AND APPLIED MATHEMATICS 



5. Show that In ~-f~ - 2 > -qrj holds for N < 1. If tf - j-^~ > 0,show 

n-O 

that < x < 1. Determine In 5 accurate to four places. 

6. How many terms in the Taylor-series expansion of sin x about x ir/6 are 
needed to determine sin 31 accurate to six places? Evaluate sin 31 accurate to six 
places. 

7. By integrating 1/(1 + x 2 ) find the Maclaurin expansion of tan" 1 x. 

8. Show that 

ao 

10 np- 2 I (2n -f 1)(2* + !)**+* * >0 

n-0 

9. Find the first four terms of the Maclaurin-series expansion of In cos x. For 
what range of x is the series valid? 

10. Find the Taylor series of cos 2 x about x ir/3. 
1L lff (n) (x) is continuous on (a, x), show that 

/&.() - f (n) (a -h 0(s - a)) (x ~ a) " ^ 1 



by making use of (10.126). 

10.24. Extrema of Functions. The Lagrange Method of Multipliers. 
One says that f(x) has an extremum at a point x = c if an 17 > exists 
such that/(c + h) - f(c) has the same sign for \h\ < 17. If /(c + h) ^ /(c) 
for |A| < 17, we say that f(x) has a local maximum at x => c. If 
/(c + A) ^ /(c) for |h| < 17, we say that /(c) has a local minimum at x = c. 
If an extremum of /(x) at x = c exists and if /(#) is differentiable at x = c, 
we have 



(10.136) 
or 



so that /'(c) = 0. Now assume f(c) = f (c) = - - =/<-D( c ) = 0, 
/ (n) (c) 5^ 0, and further assume that / (n) (a;) is continuous in a neighbor- 
hood of x = c. Applying (10.124) and the result of Prob. 11, Sec. 10.23, 
we have 



Since / (n) (c) 5^ and since c > as h 0, we can choose h sufficiently 
small so that / (n) (c) + is one-signed. Hence /(c + h) /(c) will be 
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one-signed provided n is even (h can be positive and negative). Thus a 
necessary and sufficient condition that f(x) have an extremum at x = c 
is that /'(c) = and the first nonvanishing derivative of f(x) at x = c 
be of even order. If / (w) (c) > 0, a minimum occurs, while if / (w) (c) < 0, 
a maximum occurs. Why? 

For a function of two variables we have 



/(* + *, + *)- f(x, y) 



2fxvhk 



If we neglect the higher-order terms (A 8 , A 2 fc, . . .), we note that/(# + ft, 

flf r)f 

y + k) f(x, y) will be one-signed provided ^ = 0, ~ 0, and (f xx h z + 

2fx V hk + / yi ,fc 2 ) does not change sign for arbitrarily small positive and 
negative values of h and k. Let the reader deduce that f(x, y) has an 

ftf r)/ 

extremal at (x, y) provided = 0, - = 0, and 

ux oy 



fxx > or/ yy > 0, a minimum occurs, and if f xx < or/^ < 0, a maxi- 
mum occurs. Why? 

Let us consider the function z = f(x, y) to be extremalized under the 
condition <p(x y y) = constant. If p(x, y) is a closed and bounded curve, 
and if /(x, y) is continuous at every point of the curve <p(x, y) = c, we 
know that there will be a point Po on the curve p(x, y) = c such that 
f(x, y) will take on its maximum value at Po. Remember that we restrict 
P(x, y) to lie on the curve ^(x, y) = c. The same statement applies if 
we consider the minimum value of /(x, y). Now in the elementary calcu- 
lus we would solve <p(x, y) = c for y as a function of x, say, y = ^(#), 
substitute y = $(x) into z = /(x, ?/), to obtain z = f(x, ^(#)), and then 

We would set -=- = 0. We also have 
ax 

dz _ df dfdy df df\-d v /dxl /i 

- ~ 1 



since -^ + -~ ~- = 0. We can obtain (10.138) by another procedure 
due to Lagrange. Consider the new function 

U - f(x, y) + X*(x, y) (10.139) 

We differentiate J7 with respect to x and 2/, assuming that x and ^ are 
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independent variables, while X is considered as a parameter. Setting 

dU A dU A . . , 
_ = ,- = 0yields 

5 +x 5 == (iai4o) 

dy + *dy = 

Eliminating X in (10.140) yields (10.138). This is Lagrange's method of 
multipliers. 

More generally, if y = /(xi, # 2 , . . . , # n ) is to be extremalized subject 
to the conditions ^(#i, x 2 , . . . , #) = c t , i = 1, 2, . . . , m, we form 

m 
U = /(#!, #2, , #n) + / X t ^> t (#i, 2, , # n ) 

and we set 

dU df , 



te . a ~ /.-,- - ; = 1,2, . . . ,rc (10.141) 

The elimination of the X t , i = 1, 2, . . . , m, along with the equations 
^(#i, #2, . . , x n ) = c t , i = 1, 2, . . . , ra, yields the values of xi, #2, 
. . . , x n , which extremalize ?/. 

Example 10.43. We wish to find the ratio of altitude to radius of a cylindrical glass 
(open top) having a maximum volume for a fixed surface area. We have 

V irx 2 y S 2irxy + irx 2 = constant 
From U ira; 2 !/ -f X(27ro;i/ + trx 2 ) we have 

Eliminating X yields y/x = 1. It is easy to show that y/x = 1 yields a maximum 
volume. 

Problems 

1. Consider a cylindrical buoy with two conical ends. Let x be the radius of the 
cylinder, y the altitude of the cylinder, and z the altitude of the cones. Show that 

x:y:z -;r-: 1 : 1 yields a maximum volume for a fixed surface area. 

2. Find the maximum distance from the origin to the curve x 3 -{- y* 3xy = 0. 
8. Derive (10.137) if f xx h* + 2f xv hk + f yy k 2 does not change sign for arbitrary 

values of h and &. 

4. A rectangular box has dimensions x, y t z. Show that x : y:z = 1:1:1 yields a 
minimum surface area for a fixed volume. 
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6. Consider the set of values y Q , y i} y z , . . . , y n and the polynomial 

y(x) - a + aix + a 2 x 2 + + a m x m 
m < n. Show that the set of a*, i = 0, 1, 2, . . . , w, which extremalize 
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w n n 

satisfy } s l+k a l fo with S* = ^ xj, fo = ^ yyx*. 



t-0 



y-o 



10.25. Numerical Methods. Let us assume that we wish to find a root 
of /(x) = 0, and let us further assume 
that /'(x) exists. We can plot a 
rough graph of y = /(x) and attempt 
to read the value of x at which the 
curve crosses the x axis, y = (see 
Fig. 10.4). 

If we choose a point Xi not far re- 
moved from x, f(x) == 0, we note that 
the equation of the tangent line at (xi, 
/to)) is y -/(scO =f'(x 1 )(x-z l ). 
This line, L, intersects the x axis at 
the point x 2 = Zi f(xi)/f'(xi), /'(zi) 5^ 0, obtained by setting 2/ = 0. 
This process can be continued, with 








'V I 



FIG. 10.4 



Xn+1 = X n - ~ 



f'M 







(10.142) 



If the sequence Xi, X2, . . . , x n , . . . can be shown to converge to a 
limit c, then 

f(x ) 
lim x n +i = lim x n - lim y, n/ . 



and c = c f(c)/f'(c) so that /(c) = 0, provided /(x) and /'(x) are con- 
tinuous at x = c, /'(c) 5^ 0. Thus c = x = lim x n . It is apparent that 

n > 

difficulties will occur if /'(x) is small near x = x. 

Example 10.44. For/(x) = x 2 - 2 we have/'(x) = 2x, and (10.142) becomes 

Xn+I . , _ ^ . iL2 (10 . 143) 

From (10.143) we note that < x n +i < x n provided x% > 2, x n > 0. Let the reader 
deduce that xj +1 > 2 if x\ > 2. Thus if we choose xi - 2, (10.143) will yield a 
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mono tonic decreasing sequence bounded below by zero. The sequence must converge 
to a solution of # 2 2 0. The computation may be arranged as follows: 







x 2 + 2 




- ** + 2 


n 


X n 






2* n 


1 


2 


6 


4 


1.7 


2 


1 7 


4.89 


3 4 


44 


3 


1 44 


4 0736 


2 88 


414 


4 


1 414 


3 999396 


2 828 


4142 


5 


1 4142 


3 99996164 


2 8284 


41412 



The desired root to five decimal places is x = 1.41412. 

An iteration process may be used to find a root of x <p(x) = or 
x = <p(x). We shall assume that \<p'(x)\ < A < 1. We define x n +i by 



n = 1, 2, 3, ... 



(10.144) 



with Xi arbitrary. Thus 



with fn lying between x n -i and #, by applying the law of the mean. Thus 

Kfi - x n \ = \x n - a? n -i| |^({)| < \x n ~ x n - t \A (10.145) 

NOW X n = XQ + (Xi X ) + (2 l) + ' ' ' + (# ~ n-i) 

00 

and XQ + Y |x* Xk~~i converges by the ratio test since l-^l - !L < 
k*i \ Xn "" Xn - 1 ' 

.4 < 1, from (10.145). Thus 

n 

lim x n = x + lim Y (x fc x k -i) = ^ 

n w < *W 

exists. From (10.144) and the continuity of <p(x) we have 
lim x n = x = lim 



and f is the desired root of x = <p(x). 
Example 10.45. We find a root of x -J- sin s -f 1, ^(a?) ^- sin x -f 1, 



The computation is arranged as follows: 
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n 


X n 


sin x n 


X n +l " -J Sin X n + 1 


1 








1 


2 


1 


0.8415 


1.4208 


3 


1 4208 


9888 


1.4944 


4 


1 4944 


9971 


1 4986 


5 


1.4986 


9974 


1.4987 


6 


1 4987 


0.9974 


1 4987 



We started with xi = and obtained the desired root, x 1.4987, accurate to four 
decimal places. Had we graphed y = x and y = -J- sin x -h 1, we would have noticed 
that Xi = 1.5 would be a good starting point. 

It is often useful to replace a given function by a polynomial which 
approximates the given function to a high degree of accuracy. Such a 
polynomial is called an interpolating function. Let f(x) be a function 
defined for xo ^ x ^ x n , and let us assume that we have evaluated f(x) 
at x , Xi 9 X2, . . , x n , obtaining the n + 1 values y % = /(x), i = 0, 1, 
2, . . . , 7i. A simple way of constructing a polynomial of degree n, 
L n (x), which satisfies L n (x t ) = y^ i = 0, 1, 2, . . . , n, is due to Lagrange. 
Let us note that the polynomial 



(x - xi)(x - x 2 ) 



(X 



(x - Xi)(x x 2 ) 



- X n ) 



has the value zero at x = Xi, #2, , x n and has the value ?/o at x = XQ. 
A sum of such polynomials yields Lagrange 9 s interpolating polynomial, 



L n (x) = 2/0 



(x - xi)(x - x 2 ) 



(X X n ) 



Xg) 



(x x )(x 



(X - X n -i) 



^ n (X n Xo)(x n Xi) ' ' ' (x n X n _i) 



(10.146) 



One naturally inquires how closely L n (x) approximates /(x) for x ^ x t , 
i = 0, 1, 2, . . . , n. To answer this question, we construct F(x) defined 
By 

F(x) = /(x) - L n (x) - R(x - x )(x - xi) (x - x n ) (10.147) 

with R a constant. Let us note that F(x) vanishes for x = Xo, Xi, x 2 , 
. . . , x n , since /(x) = L(x t ), i = 0, 1, 2, . . . , n. Now let us choose 
any point x of the interval xo ^ x ^ x n other than Xo, Xi, X2, . . . , x n 
and determine R so that F(x) = 0, that is, 

/(*) - L n (55) - R(Z - x )(f - *i) ( - *) - (10.148) 

Up to the present no restriction was placed on /(x). If we assume that 
/(x) is differentiable at every point of x ^ x ^ x n , then F (x) is differ- 
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entiable on this range. If we apply the law of the mean (or Rollers 
theorem) to the intervals (#0, #i), (zi, 2 ), - , (#t, #), (#, x*+i), - - , 
(x n -i, x), we note that F'(z) will vanish at n + 1 points. If we further 
assume that f"(x), /'"(x), . . . , / (w+1) (s) exist on x g x g a? n , we note 
that F (n+1) (x) will vanish at a point x = s y XQ ^ s ^ x n . Since L(x) is 

a polynomial of degree n, we have , n ^ = 0. From (10.147) we 

obtain 

F<+() = = f (n +(s) - #(n + 1)! z ^ * ^ * 

^n+l 

since -r-^+i [(# #o)(# Xi) (x x n )] = (n + 1)1. Substituting 
R = /c-+i)()/(n + 1)! into (10.148) yields 

/(n+D/o 

xO (- x.) (10.149) 



Equation (10.149) holds for any value of x in the interval XQ ^ x ^ x n 
although (10.149) was obtained by choosing x ^ x, i = 0, 1, 2, . . . , n, 
for one notes that/(x t ) = L n (x l ) is consistent with (10.149). 
If M is any upper bound of |/ (n+1) (^)l on XQ ^ x g xi, we have 

L n (x)| ^ \x - x | | - sxl - - |x - x n \ (10.150) 



The inequality (10.150) is useful in finding an upper bound to the differ- 
ence between f(x) and L n (x). 

Example 10.46. The third-degree polynomial passing through the points (0, 5), 
(1, -3), (2, -1), (3, 7) is 

2)(*-3) , x(x-2)(x-2) 

- 



x(x - l)(x -3) g(ar - l)(x -2) 

"*" ^ ; 2- 1 (-1) ^ 3-2-1 

- a; 8 - 3z 2 -f 4x"- 5 

Example 10.47. Let LI(X) be the polynomial which coincides with sin x at x =* 0> 



r/6, T/3, T/2, |TT, |r, ir. Since 



7 sin a; 



dx 7 



Af , we have 



I " 4 - *f *- / 0\ t * / /| = wi 

For x = 40 or TTTT we have 



(?) () (i) (i) (j) Q Q . i (i)' 



- 

< 0.00005 
Thus the use of L(fir) would yield a value of sin 40 accurate to four decimal places. 
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Let us consider a function f(x) defined on h^x^h. We con- 

struct the polynomial Li(x\ which coincides with f(x) at x = h, 0, h. 
From (10.146) we have 



4- i(tt\ - ft) , 

r A' - 2 -- r 



and L,(x) dx = [/(-A) + 4/(0) + /(fc)] (10.151) 

y-/ t ^ 

We may well inquire how the value of (10.151) differs from / f(x) dx. 

J h 

This difference is 



= ! h 

J- 



f(x) dx- [f(-h) + 4/(0) 
o 

We note that <p(0) = 0. Let us assume that f(x\ f"(x), /'"(x), and 
f (lv) (x) exist. Differentiating <?(/&) with respect to A yields 



+ 4/(0) 

- | [/'(A) - f(-ft)] 
and ^'(0) = 0. Differentiating again yields 

-f(-A)] - I [/'(A) -/'(-A)] - | 
-/'(-A)] - [/"(A) +/"(-*)] 



so that <^"(0) = 0. Differentiating once more yields 
" " 



- /'"(-A)] 



Now 

/"(A) dA = dh dh (-I)A^'(A) dh 



If M is any upper bound of |/ (lv) (^)l on ft g x g ft, we have 



(10.152) 
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The inequality (10.152) yields an estimate of the error if we use 
Simpson's rule as given by (10.151) to approximate f f(x) dx. We 

note that Simpson's rule is exact if f(x) is a polynomial of degree less 
than or equal to 3 since f M (x) = 0. Since (10.151) depends only on 
the values of /(x) at the equally spaced points h, 0, /&, we can extend 
(10.151) to the interval a ^ x ^ b by subdividing (a, b) into an even 
number of intervals 

a, a + hj a + 2h, a + 3A, . . . , a + (n l)h, a + nh = b 
applying Simpson 's rule to the intervals 

(a, a + 2/0, (a + 2A, a + 4/0, . . . , (a + (n - 2)A, a + nA) 
and adding. This yields 

'/(x) d* = | [/(a) + 4/(a + h) + /(a + 2/01 

+ | [/( + 2/0 + 4/(a + 3*) + /(a + 4A)J + 
+ | [/(a + (n - 2)/0 + 4/(a + (n - 1)/0 + /(ft)] + 

- | [/(a) + ^(a +.A) + 2/(a + 2/i) + 4/(a + 3fc) 

+ 2/(a + 4A) + + 4/(6 - A) + /(&)] + ^ (10.153) 



... i^i^-^r^^l Mhi t. Mh\b - a) /1A1K ^ 

with || g 3 [ w J = - m nh = jgQ ' (10.154) 

M is any upper bound of |/ (lv) (x) \ on the interval a ^ x ^ b. The greater 
the number of subdivisions chosen and hence the smaller the value of A, 
the more accurate becomes Simpson's rule for approximating an integral. 

r 2 fi 

Example 10.48. If we apply Simpson's rule to In 2 = / > n ~ 10, we note that 

Ji x 

\E\ g ^^ - 6 Yon C - 1 ) 4 < 0.0002, so that we can expect to obtain a value of In 2 

accurate to at least three decimal places. Applying (10.153) yields 

ln2w ^( 1+ O + o + o + O + T^^I^ + Tj^O" f r9" f 2) 

0.69314 
In the tables one notes that in 2 0.69315 accurate to five decimal places. 

We conclude this section with a discussion of the Euler-Maclaurin sum 
formula. Let us suppose that f(x) has continuous derivatives at least to 
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the order r on the interval g x g 1. If we integrate by parts, we obtain 
f*f(x) dx = x/(x) I - / o X of (x) dx 
= /(I) - x/'(x) dx 

/(I)] + i[/(l) - /(O)] - x/'(x) ete 
/(I)] + i /'(*) dx - f* xf(x) dx 
i[/(0) +/(!)] - (x - *)/'(*) dx (10.155) 



We define B z (x) such that B' 2 (x) = x 4", #2(1) = 0. Integration yields 
Bz(x) ^(x 2 x). Replacing x ^by B((x) in (10.155) and integrat- 
ing by parts yields 

f Q l f(x) dx = i[/(0) + /(I)] + f Q l B*(x)f"(x) dx (10.156) 

We now define B s (x) such that B f 9 (x) = B 2 (x) + 6 2 with the stipulation 
that 5 3 (0) = JB 8 (1) = 0. Thus B z (x) = z 3 /6 - x 2 /4 + b 2 x + c, and 
c = 0, 62 = A- From -82^) = BS(^) ^2 we have upon integration 
by parts 

f o l f(x) dx = i[/(0) +/(!)] - 6 2 [f(l) - /'(O)] - J* Bt(x)f'"(x) dx 

This process is continued so that a sequence of polynomials (Bernoulli) 
and a sequence of constants (Bernoulli) are generated. We have 

B' k (x) = **_,(*) + 6 4 _x A; S3 
B 4 (0) = B 4 (l) = 
and 

/(I)] - 6 2 [/'(l) - /'(O)] + bj/"(l) - 



B^,(x)/ (r) (x) dx (10.158) 

In the generation of the Bernoulli polynomials and constants no mention 
was made of J5o, -Bi, 61, 60. For convenience we choose B Q (x) = 0, 
Bi(x) = x, 6 = 1, 61 = 0. 

Let us see whether it is possible to find two functions, <(x, t), <p(t), 
such that 



(10.159) 



r ~ 



r-O 
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If $(x, t) and <p(t) exist satisfying (10.159), we call them the generating 
functions of the Bernoulli polynomials and Bernoulli numbers, respec- 
tively. The series expansions are simply the Taylor series expansions 

(see Sec. 10.23), so that b r = *> (r) (0)/r!, B r (x) = 4l ff y ^) . For- 

r! \ dr /t-o 

mally we have 



r-O 



= B((x)t + B't(x)t* + B' r (x)t 



oo 

I- 



= + (x - l)^ 2 + t (B.(x) + 6. 



t&(x,t) + t<p(t) - ^ (10.160) 



so that 



Integrating with respect to x yields 

$(*,*) = A(t)e* - *(*) + ^ 



with A(0 an arbitrary function of t. Now <i>(0, t) = since fi r (0) = 0, 
so that A(<) = <p(t) ^, and 



Finally $(1, = Br(l)^ r = * since JS^l) = 1, S r (l) = 0, r ^ 1. 
Hence [*>(0 - -}/](e - 1) = 1, and 



( 10 - 161 > 



One can now consider <p(f) and $(a;, t) of (10.161) and justify the series 
expansions (10.159) along with the term-by-term differentiation given in 
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(10.160). One then easily shows that (10.157) is valid and that B (x) => 0, 
BI(X) = x, feo = 1, bi = 0. We omit proof of this fact. An important 
property of the Bernoulli constants is discernible if we note that 

* te * 



- - _ - - 

- 2 *-'-!"" 2 - 1 

t . . . __ * * , * 

__- + , + ____ + __ 

Hence <p(t) is an even function so that &2r+i = for all r [see (10.159)]. 
If we take r = 2s + 1 and apply (10.157) to (10.158), we obtain 



f 



l f(x) dx = i[/(0) +/(!)] - W(D ~ /'(O)] 



-/'"(O)] ----- ^[/" 

B 2 . + iW/ <2 ' +1) W dx (10.162) 



If we apply (10.162) to g(x) = f(x + 1), we obtain 
g(x) dx = f(x + l)dx = *f(x) dx 



Continuing this process for the intervals (2, 3), (3, 4), . . . ,( !, n) 
and adding yields the Euler-Maclaurin formula, 



(x) dx = 

+ i/(n) - 6 2 [/'(n) - /'(O)] - 64[/'"(n) - /'"(O)] 
- 6i.[/ (1 -(n) -/<- (0)] 



dx (10.163) 

The Bernoulli polynomials and constants of (10.163) can be calculated 
from (10.157) or (10.161). It can be shown that if / (28+1) (x) is monotonic 
decreasing for ^ x ^ n and if the odd derivatives of f(x) have the same 

sign on ^ x ^ n, then the true value of f n f(x) dx always lies between 

the sums of s and s + 1 terms of the series following ^/(O) +/(!)+* 
+ f(n 1) + /(w) in (10.163). For this case we need not be concerned 
with 

n-l 

2 / m m+1 Bw(x - nW-(x) dx 

m-0 

Example 10.49. Stirling's Formula. For f(x) In (x -f 1) we note that 



(x 
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for x 2a n and / (lrhl >(aB) is monotonic decreasing on ^ a; ^ n. Integration by 

parts yields 

(n + 1) In (n + 1) - n - f * In (x + 1) dx - In 1 + In 2 + In 3 4- 

+ ln (n - 1) +*ln n - 6, (^ - l) - 6 4 (^fyy, - 2) ---- (10.164) 
Now 
(n + 1) In ( n + 1) - (n + 1) In [n (l + ^)] - (n + 1) In n 

+ (n + I) In (l + ~ 
and, for large w, (n H- 1) In (n -H 1) (n + 1) In ft> since 



Urn (n + 1) In (l + ) - 1 
n * oe ^ w / 



Neglecting those terms which tend to zero as n > , we note from (10.164) that 

(n + T) ln n ~ n ln n! + C (10.165) 

where C is the sum of the constant terms in (10.164). Let the reader apply (4.47), 
with (10.165) to show that C - In 2*. Thus 

, n! 

In - ; - n 
n n v 2am 

n! V2irnn n e- ft = V^m (-) (10.166) 



Formula (10.166) is Stirling's formula for approximating w' for n large. The true 
meaning of (10.166) is that 

lim ' 



Problems 

1. Find a root of x* 3 -f 1 =0 lying between x and x = 1, accurate to six 
decimal places. 

2. Find a root of x In (x + 1) 4- 1, accurate to five decimal places. 

3. Find a polynomial p(x) such that p(0) - 0, p(l) 1, p(2) - 2, p(3) 3, 
p(4) - 11. 

4. Apply Simpson's rule with 10 subdivisions to approximate / - -- Do the 

same for / , How accurate can you expect your answers to be? 

Jo V% - sin x 

6. Show that 6 4 - -yiir, B^(x) - (x* - 2x* + x*). 



6. Show that C - - In 2*- [see (10.165)]. 

10.26. The Lebesgue Integral. It is rather obvious that not all func- 
tions are Riemann-integrable (J?-integrable). We consider the interval 
g x g 1 and define /(x) = 1 if x is irrational, f(x) = 2 if x is rational. 
The upper Darboux integral is seen to be 2, whereas the lower Darboux 
integral is 1 (see Sec. 10.16). Thus f(x) is not fr-integrable on ^ x ^ 1. 
However, let us consider the following: It is known th&t the rationals are 
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countable (see Sec. 10.11), so that we can enumerate the rationals on the 
interval g x g 1, and we designate the rationals by 

fi, 7*2, r 3 , . . . , r n , . . . 

We define /i(x) on g x g 1 by /i(z) = 1 f or x * n, /i(n) = 2. We 
define / 2 (x) by / 2 (z) = 1, x 5^ ri, r 2 ,/(ri) = /(r 2 ) = 2. Proceeding in this 
manner, we construct a sequence of functions 

/i(*),/(*), - - ,/(*), - (10.167) 

with / n (a;) = 1 except at x = ri, r 2 , . . . , r n , and at these rational points, 
f n (x) = 2. It is apparent that 



lim f n (x) = /(x) = 1 for x irrational 
2 f or x rational 

Moreover the sequence \f n (x) } is monotonic nondecr easing as n increases, 
and we write f n (x) /*/(z). The reader can easily prove that every func- 

tion of the sequence \f n (x) } is fi-integrable, with I f n (x) dx 1 for all n. 
We define the Lebesgue integral of f(x) as 

L(jf) = f l f(x)dx SE lim f l f n (x)dx = lim 1 = 1 (10.168) 

JQ n w JQ n- 

Even though f(x) is not /Wntegrable, it is Lebesgue integrable (Z/-inte- 
grable), and the value of its Lebesgue integral is defined by (10.168). 

More generally, we have the following situation : Let C be the class of 
all J?-integrable functions on the range a ^ x ^ 6, and let f(x) be the 
limit of a monotonic nondecreasing sequence, {/ n (#) } , of functions belong- 
ing to C. We define the Lebesgue integral of f(x) by 

L(/) = f b f(x)dx = lim f b f n (x)dx (10.169) 

jo, n-t* Ja 

provided the limit exists. Let C denote the class of all such functions 
which are L-integrable. Any function which is J?-integrable is automati- 
cally L-integrable since /(x) = f n (x) /" f(x) and 



lim f b f(x)dx = f b f(x)dx 

n-K J" J* 



Thus C is a proper subset of C. Any function which is JB-integrable is 
L-integrable, and the values of the two integrals are the same. It can 
be shown that if /(x) is a limit of a monotonic nondecreasing sequence of 
functions belonging to C then/(x) also belongs to C. An analogous situ- 
ation occurs in the real-number system. We define the irrational num- 
bers as the limits of bounded sequences of rational numbers. The limit 
of a bounded sequence of irrational and/or rational numbers is again a 
real number, rational or irrational. 
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To extend the class C in order to embrace a larger class of functions, 
we consider the following: A function f(x) is said to belong to the class 
Ci if for any e > we can find two functions, g(x) and h(x), belonging to 
C such that 

g(x) S f(x) h(x) for a x g 6 
and 



If /(x) belongs to Ci we define its Lebesgue integral by 

(/) m f b f(x) dx = supLfo) = inf L(h) (10.171) 

J* geC h*C 

for all g(x) and h(x) satisfying (10.170). The reader can easily verify 
that if f(x) belongs to C then f(x) belongs to Ci and L(f) = <(/). The 
set Ci contains all Lebesgue-integrable functions. 

The ideas contained above lead to the concepts of measurable func- 
tions and measurable sets. In most texts one begins with the definition 
of the measure of a set, and afterward the Lebesgue integral is defined. 
We omit a discussion of measure, but we do emphasize that the theory 
of measure is of prime importance in modern probability theory. The 
Lebesgue integral plays an important role in modern mathematical analy- 
sis with its applications to the Fourier integral, probability theory, ergodic 
theory, and other fields. 
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A posteriori probability, 345 

A priori probability, 345 

Abelian group, 300 

Abel's lemma, 450 

Abel's test for uniform convergence, 451 

Absolute convergence, 444, 455 

Absolute tensors, 89 

Absolute value of complex number, 123 

Acceleration, 50, 52 

Addition, parallelogram law of, 38, 123 

of tensors, 89 

of vectors, 38 
Affine transformation, 313 
Aleph zero, 394 

Algebraic extension of a field, 324-326 
Algebraic numbers, 396 
Alternating series, 443 
Amplitude of complex number, 123 
Analytic continuation, 150-151 
Analytic function, 130 
Angular momentum, 115 
Angular velocity, 46 
Arc length, 52 

Archimedean ordering postulates, 385 
Arcs, rectifiable, 277 
Argand plane, 123 
Argument of complex number, 123 
Arithmetic n-space, 84 
Associated Legendre differential equa- 
tion, 230 

Associated vector, 94 
Automorphism, 304 

of a field, 331 

of a group, 304 

inner, 305 
Axiom of choice, 392 



Bayes's formula, 346 
Bayes's theorem, 344-346 
Bernoulli numbers, 475-477 
Bernoulli polynomials, 475-477 
Bernoulli trials, 349 
Bernoulli's theorem, 358 
Bessel function, 232 
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Bessel's equation, 231 
Bessel's inequality, 247, 258 
Beta function, 170 
Bianchi's identity, 106 
Bienaym^-Tchebysheff inequality, 355 
Bilinear transformation, 125 
Binomial coefficient, 339 
Binomial theorem, 339 
Binormal, 53 

"Bits" of information, 346 
Bonnet, second law of the mean for inte- 
grals, 457 
Boole, G., 388 

Boolean operator, 6 = z ,221 
dz 

Boundary of a set, 389 
Boundary point, 387 
Bounded set, 386 
Bounded variation, 277 
Brachistochrone, 289, 293 
Branch point, 149 
Buffon needle problem, 355 



Calculus of variations, 288 
problem of constraint in, 295 
variable end-point problem in, 293 

Cancellation, law of, 375 

Cantor set, 396 

Cantor's theory of sets, 394r-396 

Cardinal numbers, 394 

Cartesian coordinate system, 93 

Cauchy, convergence criterion for se- 
quence, 437 
inequality of, 391 
integral formula of, 143 
integral theorem of, 136 
nth-root test of, 442 

Cauchy's distribution function, 361 

Cauchy-Riemann equations, 148 

Cayley's theorem, 303 

Central of a group, 307 

Central-limit theorem, 359-361 

Character of a group, 312 

Characteristic function, 357 
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Characteristic roots, 24 
Characteristic vector, 24 
Chi-equared distribution, 362-364 
Christoffel symbols, 96 

law of transformation of, 98 
Christoffel-Darboux identity, 244 
Closed interval, 386 
Closed set, 386, 388 

of orthogonal polynomials, 249 
Closure of a set, 389 
Coefficients, Fourier, 245, 252 
Cofactor, 9 
Combinations, 338 
Commutative law, of integers, 375 

of vector addition, 38 
Commutator, 303 
Comparison test for series, 441 
Complement of a set, 387 
Complete set of orthogonal polynomials, 

249-250 
Complex functions, 125 

continuous, 126 

differentiability of, 127 

(See also Complex variable) 
Complex-number field, 122 
Complex numbers, 122 

absolute value of, 123 

argument of, 124 

conjugate of, 124 

modulus of, 123 

vector representation of, 123 
Complex variable, functions of, 125 
analytic or regular, 130 
derivative of, 127 
integration of, 131-136 
Taylor's expansion of, 145 
Components, of a tensor, 89 

of a vector, 41 
Composition, series of, 311 

prime factors of, 311 
Conditional probability , 345 
Conditionally convergent series, 444 
Confluence of singularities, 223 
Conjugate of a complex number, 124 
Conjugate elements of a group, 306 
Conjugate numbers, 325 
Conjugate subgroups, 306 
Conservative vector field, 67 
Continuity, 126, 398 

uniform, 399 
Continuous function, 398 
Continuous transformation groups, 314- 

315 

Contour integration, 160-162 
Contraction of a tensor, 90 
Contravariant tensor, 89 



Convergence, absolute, 444, 455 

conditional, 444 

of Fourier series, 251, 256 

interval of, 386, 453 

of power series, 453-454 

radius of, 146, 453 

of series, 439 

tests for, 440-444 

Cauchy's nth-root, 442 
D'Alembert's ratio, 441 
integral, 441 

uniform, 446, 450 
tests for, 451-453 

Abel's, 451 

Coordinate curves, 95 
Coordinate systems, 40, 84 
Coordinates, cartesian, 93 

curvilinear, 62, 65 

cylindrical, 65 

Euclidean, 93 

geodesic, 104 

normal, 32 

Riemannian, 105 

spherical, 63 

Laplacian in, 103 

transformation of, 84, 411 
Cosets, 305 
Cosh 2, 150, 158 
Cosine 2, 131 
Countable collection, 394 
Co variant curvature tensor, 106 
Covariant derivative, 100 
Covariant differentiation, 100 
Covariant tensor, 89 
Covariant vector, 87 
Cramer's rule, 10 
Craps, game of, 348 
Criterion, of Eisenstein, 328 

of Lipschitz, 183 

Cross or vector product of vectors, 45 
Curl, 59 

of a gradient, 60 

of a vector, 59, 83, 102 
Curvature, radius of, 53 

scalar, 106 

Curvature tensor, 106, 108 
Curve, unicursal, 435 

(See also Space curve) 
Curves, family of, 174 
Curvilinear coordinates, 62 

curl, divergence, gradient, Laplacian 

in, 65 
Cyclic group, 300 



D'Alembert's ratio test, 441 
Darboux integral, 419 
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Definite integral, 131, 419-424 

mean-value theorems for, 422, 457 
Del (V), 57 

De Moivre, formula of, 125 
Density or weight function, 237 
Dependence, linear, 20, 187 
Derivative, 402 

co variant, 100 

of a determinant, 8 

of an integral, 142 

of matrix, 13 

partial, 408 

of a vector, 50 
Desargues' theorem, 40 
Determinant, cof actors of, 9 

derivative of, 8 

expansion of, 9 

of matrix, 5, 13 

minors of, 10 

order of, 6 

Vandermonde, 334 

Wronskian, 187 
Determinants, 5 

multiplication of, 9 

properties of, 7-10 

solution of equations by, 1 
Diameter of a set, 390 
Dice (craps), 348 
Differential, 402 

operator, 410 

total, 409 
Differential equations, 172 

Bessel's, 231 

in complex domain, 201 

definition of, 172 

degree of, 175 

exact, 178 

first-order, 175 

homogeneous, 177 

hypergeometric, 221 

integrating factors of, 1 79 

Laplace's, 226 

Legendre's, 201, 222 

linear (see Linear differential equa- 
tions) 

nonlinear, 271 

partial (see Partial differential equa- 
tions) 

second-order, 199-202 

separation of variables in, 175 

simultaneous, 190 

singular point of, 207 

solutions of, existence of, 183 
general, 182, 191 
particular, 182 
uniqueness of, 185 

Sturm-Liouville, 200 



Differential equations, system of, 190 
wave, 172, 286 

Differential operator, s = , 192 
ax 

Differentiation, 402 

covariant, 100 

of Fourier series, 262 

implicit, 403 

partial, 408 

rules, 61, 403 

of series, 450 

of vectors, 50, 61 
Diffusion equation, 367-368 
Dini's conditions, 457-459 
Direction cosines, 43 
Directional derivative, 56 
Discontinuity of a function, 398 
Distribution, chi-squared, 362-364 

Gaussian, 353 

moment of, 352 
Distribution function, 351 

Cauchy's, 361 
Distributive law, 376 
Divergence, 58, 102 

of a curl, 61 

of a gradient, 59 

of series, 439 

Divergence theorem of Gauss, 75 
Dot, or scalar, product, 42, 88 



Eigenfunction, 24 

Eigenvalue, 24 

Eigenvector, 24 

Einstein, Albert, summation convention 

of, 1 

Einstein-Lorentz transformations, 320 
Eisenstein, criterion of, 328 
Electric field, 165 
Electrostatic potential, 76, 165 
Elliptic integrals, 436 
Entire functions, 145 
Equally likely events, 342 
Equation, Bessel's, 231 

differential (see Differential equations) 

Euler-Lagrange, 290 

exact, 178-179 

indicial, 210 

integral, 203 

Laplace's, 76, 78, 226 

Legendre's, 230, 404 

linear homogeneous, 17 

principal, 326 

wave, 172, 286 
Equations, Cauehy-Riemann, 128 

of motion, Euler's, 115 
Hamilton's, 113 
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Equations, of motion, Lagrange's, 110 
solution by determinants, 10 
systems of, 2, 4 

homogeneous linear, 17 

Equivalence relation, 376 

Error, in Simpson's rule, 473-474 
in Taylor's series, 462 

Essential singularity, 157 

Euclidean coordinates, 93 

Euclidean space, 93, 107 

Eudoxus, 385 

Euler's equations of motion, 115 

Euler-Lagrange equation, 290 

Euler-Maclaurin sum formula, 474-478 

Even function, 256 

Exact equation, 178-179 

Expansions, in Fourier series, 252, 260 
in Maclaurin's series, 462 
in orthogonal polynomials, 246, 251 
in Taylor's series (see Taylor's series or 
expansion) 

Expectation, 354 

Exterior point, 387 

Extrerna of functions, 466-468 



Factor, integrating, 179 
Factor group, 307-310 

index of, 308 
Factorials, 168-171 

Stirling's formula for, 477-478 
Family of curves, 174 
Field, 122, 323, 380 

automorphisms of, 331 

electric, 165 

extension of, algebraic, 324-326 
normal, 330 

of probability, 343 

of sets, 342 

steady-state, 41 

vector, conservative, 67 
irrotational, 69, 73 
solenoidal, 77 
steady-state, 41 
uniform, 42 

First-order differential equation, 175 
First theorem of the mean, 422 
Flat space, 107 
Force moment, 116 
Formula, Cauchy's integral, 143 

interpolation, of Lagrange, 471 

Stirling's, 477-478 
Fourier coefficients, 245, 252 
Fourier integral, 265 
Fourier partial sum, 245, 258 



Fourier partial sum, minimal property 

of, 248 

Fourier remainder, 247 
Fourier series, convergence of, 251, 256 

differentiation of, 262 

expansion in, 252, 260 

integration of, 264 

partial, minimal property of, 248 

trigonometric, 252 
Fourier transform, 268 
Frenet-Serret formulas, 52-53 
Frobenius, method of, 215 
Function, analytic, 130 

Bessel, 232 

beta, 170 

of bounded variation, 277 

characteristic, 357 

of a complex variable (see Complex 
variable) 

conjugate, 124 

continuous, 126, 398 
piecewise, 258 
sectionally, 258 
uniformly, 399 

density, or weight, 237 

differentiate, 127, 402 

discontinuous, 398 

distribution, 351 

entire, 145 

even, 256 

expansion of, 145 

extrema of, 466-468 

factorial, or gamma, 168 

generating, 234, 476 

homogeneous, 177 

hypergeometric, 222, 223 

integrable, Lebesgue, 478-480 
Riemann, 419-424 
square, 246, 257 

limit of, 397-398 

memory, 270 

of more than one variable, 407 

odd, 256 

potential, 111 

probability, 352 

rational, 434 

of a real variable, 396 

regular, 130 

symmetric, 320, 321 

transfer, 270 
Functional, 292 
Functions, orthogonal, 237 

(See also Complex functions) 
Fundamental theorem, of algebra 
(Gauss), 145 

of arithmetic, 381 

of the integral calculus, 142, 423 
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Galois group, 332 

Galois resolvent, 330 

Game theory, 368-372 

Gamma function, 168-171 

Gauss divergence theorem, 75 

Gauss fundamental theorem of algebra, 

145 

Gaussian distribution, 353 
General solution of differential equations, 

182, 191 

Generating function, 234, 476 
Geodesic coordinates, 104 
Geodesies, in a Riemannian space, 95 

of a sphere, 289 
Gradient of a scalar, 55 
Greatest common divisor, 323 
Green's theorem, 77 
Group, Abelian, 300 
automorphism of, 304 

inner, 305 
central of, 307 
character of, 312 
conjugate elements of, 306 
conjugate subgroups of, 306 
continuous transformation, 314-315 
cyclic, 300 
definition of, 298 
factor, 307 

index of, 308 
finite, 300 
Galois, 332 
octic, 312 
order of, 301 
quotient, 307 

index of, 308 
regular permutation, 303 
representation of, 313 
irreducible, 314 
reducible, 314 
simple, 307, 310 
subgroup of, 301 
invariant, 306 
normal, 306 

maximal, 309 
symmetric, 302 
Groups, intersection of, 307 
isomorphism of, 303 
prime-factor, 311 
product of, 308 



Hamilton-Cayley theorem, 35 

Hamiltonian, 113 

Hamilton's equations of motion, 113 

Harmonic series, 440 

Heat flow, 227 

Heine-Borel theorem, 392-393 



Helix, 53 

Hermite polynomials, 233, 239, 241 

generating function for, 234 
Hermitian matrix, 34 
Hessian, 419 
Hilbert space, 391 
Homogeneous function, 177 
Homogeneous linear equation, 17 
Hyperbolic cosine, 150, 158 
Hyperbolic sine, 150 
Hypergeometric equation, 221 
Hypergeometric function, 222 

of Kummer, 223 
Hypersurface, 95 



Identity theorem, 151 
Impedance, 270 
Implicit differentiation, 403 
Implicit function theorems, 412-419 
Improper integral, 282, 427 

Cauchy value of, 282 
Independence, linear, 187 
Independent events, 347-348 
Index of factor group, 308 
Indicial equation, 210 
Inertia, moment of, 117 

tensor, 116 
Infemum, 383 
Infinite integral, 428 

uniform convergence of, 429 
Infinite series, of constants, 439 

convergence of, absolute, 444, 455 
conditional, 444 
test for, 440-444 
uniform, 450-454 

expansion in, 246, 252, 461, 465 

of functions, 449 

theorems concerning, 447-450 

of trigonometric functions, 252 
Infinitesimal, 397 
Infinitesimal transformation, 315 
Infinity, point at, 158, 218 
Information theory, "bits" of informa- 
tion, 346 
Integers, commutative law of, 375 

positive, 374r-377 

properties of, 380-381 

rational, 378-379 

Integrable-square functions, 246, 257 
Integral, Darboux, 419 

definite (see Definite integral) 

derivative of, 142 

elliptic, 436 

equation, 203 

formula of Cauchy, 143 

Fourier, 265 
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Integral, improper, 427 

infinite, 428, 429 

Lebesgue, 478-480 

line, 66 

mean-value theorem, 422, 457 

Riemann, 131, 419 

Stieltje, 280 

test for convergence, 441 

theorem of Cauchy, 136 
Integral equation of Volterra, 203 
Integrals with a parameter, 142, 425-426 
Integrating factors, 179 
Integration, of a complex variable, 131- 
136 

contour, 160-162 

of Fourier series, 264 

of Laplace's equation, 78 

methods of, 431-436 

numerical, 474-477 

by parts, 143, 424 

of a real variable, 419-420 

of series, 450, 458 

term-by-term, 448, 458 
Interior point, 387 
Interlacing of zeros, 199, 244 
Interpolation formula of Lagrange, 471- 

472 

Intersection of sets, 388 
Interval, closed, 386 

of convergence, 386, 453 

open, 386 
Invariant, 320 

subgroup, 306 
Irreducible polynomial, 324 
Irrotational vector field, 69, 73 
Isomorphism, 1585 

of groups, 303 
Isoperimetric problem, 297 



Jacobian, 62 
Jordan curve, 131 
Jordan-Holder theorem, 310-312 



Kepler's first law of planetary motion, 52 
Kinetic energy, 111 
Kronecker, 324 
Kronecker delta, 3, 9 
Kryloff-Bogoluiboff, method of, 271 
Kuinmer's confluent hypergeometric 
function, 223 



Lagrange's equations of motion, 110 
Lagrange's interpolation polynomial, 
471-472 



Lagrange's method of multipliers, 296, 467 

Lagrangian, 111 

Laguerre polynomials, 224, 239, 241 

associated, 224 

ordinary, 226 
Laplace transformation, 282 

inversion theorem of, 287 

table of, 285 
Laplace's equation, 76, 226 

integration of, 78 

solution of, 226 
Laplacian, 59, 103 

in cylindrical coordinates, 65 

in orthogonal coordinate system, 65 

in spherical coordinates, 103 

in tensor form, 103 
Laurent expansion, 154 
Law of the mean, for differential calcu- 
lus, 405 

for integral calculus, 422, 424, 457 
Lebesgue integral, 478-480 
Legendre polynomial, 230, 239, 404 

associated, 230 

Legendre's duplication formula, 169 
Legendre's equation, 230, 404 
Leibniz, rule of, 403 
Length of arc, 52, 95 
L'HospitaPs rule, 405 
Limit, 397-398 
Limit cycle, 273 
Limit point, 386 
Line element, 93 
Line integral, 66 
Linear dependence, 20, 187 
Linear differential equations, with con- 
stant coefficients, 191 

definition of, 175 

exact, 178 

existence theorem of, 183 

first-order, 175, 177 

general solution of, 182, 191 

second-order, 199, 202 
indicial equation, 210 
ordinary point, 202 
properties, 199-201 
regular singular point, 210 

systems of, 190 
Linear equations, 2 

homogeneous, 17 
Linear independence, 187 
Liouville's theorem, 145 
Lipschitz criterion, 183 
Ln 2, 148 



M test of Weier&trass, 451 
Maclaurin series or expansion, 462 
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Matrices, 11 

comparable, 12 

equality of, 11 

multiplication of, 12-13 

sum of, 12 
Matrix, characteristic equation of, 24 

column, 20 

derivative of, 13 

determinant of, 5, 13 

Hermitian, 34 

identity, 14 

inverse of, 16, 29-30 

multiplication by a scalar, 12 

negative of, 12 

orthogonal, 22 

skew-symmetric, 14 

spur of, 15 

square, 11 

nonsingular, 16 

subdivision of, 32 

symmetric, 14 

trace, 15 

transpose of, 14 

triangular, 28 

unitary, 22 

zero, 12 

Maximal normal subgroup, 309 
Maximum-modulus theorem, 151 
Mean, law of, 405, 422, 424, 457 
Mean or expected value, 405 
Memory function, 270 
Methods of integration, 431-436 
Metric tensor, 93 
Minimal property of partial Fourier 

series, 248 
Mixed strategy, 370 
Moment, of a distribution, 352 

force, 116 

of inertia, 117 
Momentum, angular, 115 

generalized, 113 
Monodromy theorem, 153 
Monte Carlo methods, 364-368 
Xlorera's theorem, 145 
Motion, equations of (see Equations) 

in a plane, 52 

of rigid body, 46-47 
Multipliers of Lagrange, 296, 467 
Mutually exclusive events, 343 



Navier-Stokes equation of motion, 117 
Neighborhood, deleted, 387 

spherical, 391 

Nested sets, theorem of, 392 
Newton's binomial expansion, 339 
Nonessential singular point, 157 



Nonlinear differential equations, 271 
Normal to a surface, 56, 71 
Normal acceleration, 52 
Normal coordinates, 32 
Normal derivative, 56 
Normal distribution of Gauss, 353 
Normal extension of a field, 330 
Normal subgroup, 306 

maximal, 309 
Normalizing factors, 240 
Null set, 389 
Numbers, algebraic, 396 

Bernoulli, 475-477 

cardinal, 394 

complex, 122-124 

conjugate, 325 

primitive, 328 

real, 382 

triples, 81 

Numerical integration, 474-477 
Numerical methods, 469-478 



Octic group, 312 

Odd function, 256 

One-to-one correspondence, 394 

Open interval, 386 

Open set, 387 

Operator, of Boole, 221 

of continuation, 154, 207 

differential, 192 
vector, 57, 60 
Ordered set, 377 
Ordering theorems, 377 
Ordinary differential equation, 172 
Ordinary point, 202 
Orthogonal curvilinear coordinates, 62 
Orthogonal functions, 237 
Orthogonal polynomials (see Poly- 
nomials) 

Orthogonal trajectories, 176 
Orthogonal transformations, 22 
Orthogonal vectors, 82 
Orthonormal set, 241 



Paperitz, 221 

Paraboloidal coordinates, 95 
Parallel displacement, 108 
Parallel vectors, 38, 108-109 
Parallelogram law of addition, 38, 123 
Parameters, integrals containing, 425 

variation of, 197 
Parseval's identity, 265 
Partial derivative, 408 
Partial differential equations, Fourier's 
method of solving, 226-228 



ELEMENTS OF PUKE AND APPLIED 'MATHEMATICS 



Partial differential equations, heat equa- 
tion, 368 

Laplace's equation, 226 
wave equation, 172, 286 
Partial differentiation, 408 
Partial fractions, 431-432 
Partial sums, 245, 258 
Particular solutions, 182, 194-198 
Peano's postulates, 374 
Pearson, K., x* distribution of, 363 
Permutations, 337-338 
Picard's method of successive approxi- 
mations, 184, 203 
Picard's theorem, 157 
Piecewise continuous function, 258 
Point, boundary, 387 
branch, 149 
exterior, 387 
at infinity, 158, 218 
interior, 387 x 
limit, 386 
neighborhood of, 387 

deleted, 387 
ordinary, 202 
set theory, 386-389 
singular, 156, 157 

of differential equation, 207 J* 
essential, 157 
nonessential, 210 
regular, 210 
removable, 157 
Poker hands, 338 
Poles, simple, 157 
Polynomials, 322 
BernouUi, 475-477 
Hermite, 233, 234, 239, 241 
irreducible, 324 

Lagrange's interpolation, 471-472 
Laguerre, 224, 226, 239, 241 
Legendre, 230, 239, 404 
normal, 330 
orthogonal, 237 
closed sets of, 249 
complete sets of, 249 
completeness of, 250 
difference equation for, 243 
expansions in, 246, 251 
zeros of, 242 
relatively prime, 324 
separable, 328 
Tchebysheff, 241 

Positive-definite quadratic form, 28 
Positive integers, 374-377 
Potential, electrostatic, 76, 166 
Potential function, 111 
Power series, 453 
convergence of, 453 



Power series, convergence of, radius of, 

453 
uniform, 453-454 

differentiation of, 454 

expansions in, 462 

functions defined by, 151 

integration of, 455 

solutions of differential equations by, 

206, 212-217 
Pressure, 119 

Prime factors of composition, 311 
Primitive number, 328 
Principal directions, 118 
Principal equation, 326 
Probability, a posteriori, 345 

a priori, 345 

axiomatic definition of, 343 

central-limit theorem of, 359-361 

conditional, 345 

continuous, 351 

field of, 343 

function of, 352 
distribution, 351 

Gauss, distribution of, 353 

joint, 355 
Pursuit problem, 54 



Quadratic forms, 21 
positive-definite, 28 

Quotient group, 307 
index of, 308 

Quotient law of tensors, 90 



Raabe's test, 442 

Radius, of convergence, 146, 453 

of curvature, 53 
Random variable, 351 
Random-walk problem, 350, 367 
Ratio test of D'Alembert, 441 
Rational function, 434 
Rational integers, 378-379 
Rational numbers, 379-380 
Real-number system, 382-385 
Recapitulation of vector differential for 

mulas, 61 

Reciprocal tensors, 92, 94 
Rectifiabte path, 277 
Recursion formula, 202 
Regular function, 130 
Regular permutation group, 303 
Regular singular point, 210 
Relative tensors, 89 
Relative velocity, 54 
Remainder in Taylor series expansion, 

462 
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Residue, 158, 160 
Ricci tensor, 106 

Riemann integral, complex-variable 
theory, 131 

real-variable theory, 419-424 
Riemann surface, 149 
Riemann-Christoffel tensor, 106 
Riemannian coordinates, 105 
Riemannian curvature tensor, 106 
Riemannian metric, 93 
Riemannian space, 93 

geodesies in, 95 
Rigid body, motion of, 46-47 
Ring, 379 

Rolle's theorem, 404 
Rule, Cramer's, 10 

Simpson's, 473-474 
Russell number, 382-385 



Saddle element, 369 
Scalar, 37 

gradient of, 55 

Laplacian of, 59, 103 
Scalar curvature, 106 
Scalar product of vectors, 42, 88 

triple, 47 

Schwarz-Cauchy inequality, 391 
Schwarz-Christoffel transformation, 163 
Second law of the mean, 457 
Second-order differential equations, 199, 
202 

properties of, 199-201 
Sectionally continuous function, 258 
Separation of variables, method of, 227 
Sequence, 437 

convergence of, Cauchy criterion for, 
437 

convergent, 437 

uniformly, 446 
Sequential approach, 391 
Series, 439 

alternating, 443 

comparison test for, 441 

of composition, 311 

convergent, 439 

differentiation of, 450 

divergent, 439 

Fourier (see Fourier series) 

harmonic, 440 

infinite (see Infinite series) 

integration of, 450, 458 

Maclaurin, 462 

oscillating, 439 

power (see Power series) 

sum of, 439 



Series, Taylor's (see Taylor's series or 
expansion) 

uniformly convergent, 450 
Set, boundary of, 389 

bounded, 383, 386 

Cantor, 396 

closed, 386, 388 

closure of, 389 

complement of, 387 

countable, 394 

denumerable, 394 

diameter of, 390 

infemum of, 383 

limit point of, 386, 389 

nondenumerable, 394 

null, 389 

open, 387 

ordered, 377 

orthonormal, 241 

supremum of, 383 

totally ordered, 377 

uncountable, 394 
Sets, Cantor's theory of, 394-396 

field of, 342 

intersection of, 388 

nested, theorem of, 392 

un^| of, 388 
Simple closed curve, 131 
Simple group, 307, 310 
Simple mappings, 124 
Simpson's rule, 473-474 
Simultaneous differential equations, 190 
Sine z, 131 

Singular points (see Point) 
Singularities, confluence of, 223 
Sinh 2, 150 
Slope, 402 
Space curve, 52 

arc length of, 52 

curvature of, 53 

Jordan, 131 
tangent to, 52 

torsion of, 53 

unit binormal of, 53 

unit principal normal of, 53 
Spherical coordinates, 63 
Steady-state vector field, 41 
Stieltje's integral, 280 
Stirling's formula, 170, 477-478 
Stochastic variable, 351 
Stokes's theorem, 70 
Strain tensor, 118 
Stress tensor, 118 
Sturm-Liouville equation, 200 
Subgroups, conjugate, 306 
invariant, 306 
normal, 306 
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Rub -act*''' i .tors, 38 
Su> of a t^ 439 

t ^pnvention, 1 
fe, v l 

.premum, 383 
orface, 70 
normal to, 56, 71 
Symmetric functions, 320 

fundamental theorem of, 321 
Symmetric group, 302 
System of equations, 4 



Tan- 1 z y 150 

Tangent to a space curve, 52, 84 
Tauber, example of, 455 
Taylor's series or expansion, 145, 461- 
465 

for functions of n variables, 464-465 

remainder in, 462 

uniqueness of, 147 
Tehebysheff's theorem, 354 
Tensor, components, 89 

contraction, 90 

contravariant, 89 

covariant, 89 

curvature, 89 

density, 89 

inertia, 116 

iso tropic, 91 

metric, 93 

mixed, 89 

Ricci, 106 

Riemann-Christoffel, 106 

strain, 118 

stress, 118 

weight of, 89 
Tensors, absolute, 89 

addition of, 89 

product of, 89 

quotient law of, 90 

reciprocal, 92, 94 

relative, 89 
Termwise differentiation and integration, 

448-450, 458 

Theorem of the mean, differential cal- 
culus, 405 

integral calculus, 422, 423, 457 
Theory of games, 368-372 
Torque, 116 

Torsion of a space curve, 53 
Total differential, 409 
Totally ordered set, 377 
Trajectories, orthogonal, 176 
Transfer function, 270 



Transform, Fourier, 268 

Laplace, 282 
Transformations, affine, 313 

bilinear, 125 

continuous group, 314-315 

infinitesimal, 315 

Laplace (see Laplace transformatw 

orthogonal, 22 

Schwarz-Christoffel, 163 
Triangular matrix, 28 
Trigonometric series, 252 
Trihedral, 41, 53, 63 
Triple scalar product, 47 
Triple vector product, 48 



Uncountable set, 394 
Undetermined coefficients, method of 

194 

Unicursal curve, 435 
Uniform continuity, 399 
Uniform convergence, Abel's test for, 

of a sequence, 446 

of a series, 450 

Weierstrass M test for, 451 
Union of two sets, 388 
Unitary matrix, 22 



Vandermonde determinant, 334 
Van der Pol's equation, 273 
Variation, bounded, 386 

calculus of (see Calculus of variatioi 

of parameters, 197 
Vector, 37 

associated, 94 

binormal, 53 

characteristic, 24 

column, 20 

components of, 41, 81 
physical, 87 

conservative, 67 

contravariant, 84 

curl of, 59, 83, 102 

derivative of, 50 

divergence of, 58, 102 

irrotational, 69, 73 

length of, 37, 94 

normal, 53 

operator del (V), 57 

parallel displacement of, 108 

potential, 77 

solenoidal, 77 

space, 41 

tangent, 52 

unit, 37, 81 

zero, 37 
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otor field (see Field) 

- tors, addition of, 38 
-ngle between, 42 

Hfferentiation of, 50, 61 
tquality of, 37 

linear combination of, 39 

orthogonal, 82 

parallel, 38, 108-109 

scalar or dot product of, 42, 88 

subtraction of, 38 

triple scalar product of, 47 

triple vector product of, 48 

unit, fundamental, 41 

vector or CTORS product of, 45 

elocity, 50 

angular, 46 
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Velocity, relative, 
Volterra integral e& j^ 

Wave equation, 172, 4. 
Weierstrass approximat thuW 1 ^ 250 
Weierstrass M test, 451 "** " 4 

Weierstrass-Bolzano theorem, 389 L *' 
Work, 112 '<- 

Wronskian, 187 

Zermelo postulate, 392 
Zero, Aleph, 394 
Zero matrix, 12 
Zero sum game, 368 
Zeros, interlacing of, 199, 244 
of orthogonal polynomials, 242 



